In Progress

File parser to extract information and create report

Below is the description of the program I want to be implemented.

############### The algorithm ##########################

Initialization phase: Load the attributes from the “[url removed, login to view]” to the memory.

Repeat every “parsing_interval” seconds

Copy the next “oldest” file (say “[url removed, login to view]”) from the “inbox_dir” directory to the “work_dir” directory (the oldest file can be determined from the file name; the file name convention is described below)

Unzip the [url removed, login to view] file (located in the “work_dir” directory). It will result in a text file, say “[url removed, login to view]”

Parse the “[url removed, login to view]” as described in the section “Parsing Procedure below” (and add parsing results to the report file).

Delete both “[url removed, login to view]” and “[url removed, login to view]” from the “work” directory.

############## The Parsing Procedure #####################

While not EOF do

Read next line from the file “[url removed, login to view]”

If the value of the “Primary attribute” is equal to ANY of the possible values listed in the “[url removed, login to view]” then do the following

If the value of of each secondary attribute is equal to ANY of the possible values listed in the "[url removed, login to view]" then

Get ALL the values associated with the secondary attributes (loaded from the [url removed, login to view] )

Append the primary attribute name, primary attribute value, and all secondary attribute names and values to the “report_file” (If the report_file is not present, then create a new one)

############## The Config File ##########################

# time interval for parsing the next file in seconds

parsing_interval: 60

# Report lifespan in days

Report_life: 30

# Directory paths

Inbox_dir: C:\Users\jsmith\Documents\inbox_dir

Work_dir: C:\Users\jsmith\Documents\work_dir

Report_dir: C:\Users\jsmith\Documents\report_dir

# Primary Attribute

Attribute2: value2_1, value2_2, value2_3

# Secondary Attribute Lists

Attribute1: Value1_1, Value1_2

Attribute3: Value3_1, Value3_2, Value3_3

############### Important notes ##########################

1) The “gzip” files are very huge (It could be 400MB) so the code should be scalable

2) The code should run on both Windows and Unix/Linus environments.

3) The files in the “inbox_dir” will not be deleted or moved.

4) The files in the “inbox_dir” have the following naming convention: YYYYMMDD-SEQNO_*.gzip.

############ Sample contents of the “[url removed, login to view]” file ########################

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

mestamp attribute1:value1 attribute2:value2 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute5:value5 attribute6:value6 ..... attributen:valuen

timestamp attribute1:value1 attribute2:value2 attribute3:value3 attribute4:value4 attribute5:value5 attribute6:value6 ..... attributen:valuen

##########################################################

1) Please let me know if you have any questions. I’ll send you a sample gzip file via private message so that you can test the application on real data.

2) If you respond to this request, please tell me the language you'll use and the approximate timeframe you'll need to do this and the approximate cost and I'll send you more details.

Please let me know if you have any questions.

Skills: C Programming, Data Processing, Java, Perl, Python

See more: unix programming language, report programming, programming test questions, programming and algorithm, programming algorithm questions, parse programming, oldest programming language, create a programming language, c algorithm questions, algorithm sample code, algorithm programming questions, algorithm and programming, abc programming language, abc programming, perl parser, gzip, extract information to, eof, create report, notes create section, parser report, application parsing text, perl read config, append line file, cost file

About the Employer:
( 167 reviews ) Burlingame, United States

Project ID: #4302168

Awarded to:

ShawarmaMan

Hi, I have over 10 years of experience with Perl and I'd be happy to help you with this. Please see my PM for more details.

$200 USD in 3 days
(0 Reviews)
0.0

35 freelancers are bidding on average $184 for this job

samitXI

Please check your inbox. Thanks

$315 USD in 7 days
(179 Reviews)
7.2
dobreiiita

Hi, I am JAVA expert and can surely help you here, Please share sample gzip file. Thanks

$400 USD in 10 days
(381 Reviews)
7.4
gangabass

I can do this for you. See PMB for details.

$100 USD in 2 days
(280 Reviews)
6.5
wbslivera

hello, I can help you. Thanks

$150 USD in 7 days
(209 Reviews)
6.2
hbxfnzwpf

I am c++ expert, I can handle this, please check you private message.

$125 USD in 2 days
(78 Reviews)
6.4
shupantha

I have over 10 years of experience in applications development in C++. Please check your PM for my comments on your project.

$250 USD in 7 days
(42 Reviews)
6.2
atchayagandhi

Hi, The project is easily realisable with java. We have expertise with the same. Request you to kindly refer your private message board for further details.

$60 USD in 2 days
(47 Reviews)
6.3
mantislin

Hi sir, please check PM, thx Kimi.

$317 USD in 6 days
(56 Reviews)
6.1
idleswell

Hello, I am a Perl scripting expert interested in your project. I will provide details of my bid in a PMB. A IDLER

$189 USD in 6 days
(166 Reviews)
6.1
akhila27

##### YOUR SEARCH ENDS HERE! ##### GET IT RIGHT THE FIRST TIME!. Check the message and contact us. SI TEAM.

$400 USD in 21 days
(21 Reviews)
6.4
sysx

Let's start

$100 USD in 5 days
(18 Reviews)
5.9
buzzcoder

Hi I can help you, please check your pm,thanks.

$400 USD in 4 days
(50 Reviews)
5.7
fejs

Hi Sir. I'm expert in Python programming. I've done similar project recently, so this task shouldn't be problem. Can You send me zip file to test application? Best regards, Fejs.

$50 USD in 2 days
(74 Reviews)
5.7
chinbochen

Hi sir, I'm senior java developer, I'm interested. Thanks.

$350 USD in 20 days
(20 Reviews)
5.9
mjef

I have more than 7 years of professional experience in cross-platform C++ programming. I can help you.

$200 USD in 7 days
(16 Reviews)
5.3
j2eesoafree

Hi I can deliver this ensuring high quality and flexibility using Java. Please let me know any questions, and share a sample file. regards, kD

$400 USD in 20 days
(25 Reviews)
5.5
dipakjadhav08

I have 4 years experience in Java. Also I am Oracle certified for java 6. I can help you.

$100 USD in 1 day
(38 Reviews)
4.6
russianitgroup

Hello, Will be glad to discuss the project. Regards, IT-Group

$170 USD in 5 days
(23 Reviews)
4.6
DenialWang

can do in both java and c#, while c# code can't be used in unix/linux; we may discuss this later. thanks/denial

$100 USD in 3 days
(6 Reviews)
4.3
charleslewisjose

I can help you out with this project.

$250 USD in 7 days
(11 Reviews)
4.2