In Progress

Data Processing/Scraping from Standard Format txt Files

Hi, we are looking to hire someone to manipulate already existing data files (will be given web link) that are in a standard .txt file format with numeric and text entries to a format used for computing.

1) We would like you to start with taking 100 of the entries (randomly selected with random number generator) in one of the 30 files we will give you.

2) We would like you to transform these 100 entries into a matrix in .csv form based on pre-specified categories given by us. Two of the columns are word and word count. Another is entry ID.

3) We also would like a sparse representation of the two columns of word and word count where there is a new matrix (rows are entry #, columns are word label - filled with the count) and that depends on size of file. We can talk about this.

4) The deliverable should be in manageable csv file sizes, which won't be a problem for this data...

But, we will definitely have more work if this is done successfully (over all files and more entries needed), so scalable routines are highly encouraged. Thinking about a million entries with a higher budget, if this goes well.

Thank you very much.

Please note that we will only hire someone who has the ability to do this automatically since we are looking for FUTURE work primarily. This is just a pilot.
Once we go from 100 entries to 1 million, manual typing will not work. We realize that file size will be an issue depending on the matrix, so if things eventually need to be broken apart into let's say 1000 files of 1000 entries, we will then use this with parallel computing routines for our computations. Thank you so much and we look forward to working with you.

Skills: Big Data, Data Entry, Data Mining, Data Processing, Web Scraping

See more: web scraping hire, sparse matrix in c, hire a web scraping, another word for data entry, transform word, standard, Data Processing, Data matrix, text file scraping, data scraping word, standard work format, sparse, csv files processing, data matrix generator, random data, csv data generator, text file generator csv, talk random, csv scraping, common data format standard browsable representation irrespective format, data transform, numeric entry work, random budget generator, count word files, random matrix

About the Employer:
( 3 reviews ) Portland, United States

Project ID: #5006785

Awarded to:


Hi, I specialize in automated data extraction, processing and analysis using my own scripts. On my feedback page you'll find many examples of such projects I've completed here on Freelancer. I am currently the top prov More

$250 USD in 2 days
(158 Reviews)

41 freelancers are bidding on average $144 for this job


Respected sir, We saw project description and got complete idea about project. We are expert in Big Data, Data Entry, Data Mining, Data Processing and Web Scraping!!! We have worked on many similar tasks before and More

$231 USD in 4 days
(61 Reviews)

Dear "statsphd" Hope you are doing well. I have reviewed the project details and would like to offer our services. We have completed many Research/Data collection/Product add/Data mining assignments on [url removed, login to view] More

$151 USD in 3 days
(62 Reviews)

Hello Sir, We are a big set up company with excellent skilled operator who have a lot of experience in this segment, our employee complete more than 300 similar job, i have gone through your project specification, i More

$144 USD in 3 days
(95 Reviews)

Dear Client, I can help in your project. We have already experience of working on similar projects. Please see below to get idea of our experience: Amazon/Ebay Bots: [url removed, login to view] More

$206 USD in 5 days
(53 Reviews)

Hi.. Expert web scraper/Data Minor here. Interested in your project. I assure you 100% accurate and good quality work. Regards

$105 USD in 3 days
(69 Reviews)

Hello Sir, We are a professional company specialized in Data Mining and Web Scraping. We have our own server, team and tools for data mining and scraping efficiently and accurately. We can parse your given text More

$155 USD in 4 days
(54 Reviews)

Hi, I am much interested in this work. Please share me more details with sample text file and describe me what would like to do. I can automate all of the process once I get understood your requirement. Please sha More

$100 USD in 3 days
(20 Reviews)

Hi I'm interested and I like to know more details about your project to bid accordingly. I have experience doing programs and scripts in some projects here and in other freelancer site. I have Skype, Gtalk, MS More

$35 USD in 3 days
(9 Reviews)

Dear Sir We are reday to maipulate your data in [url removed, login to view] have lots of experience in [url removed, login to view] will create macros in excel for your project if [url removed, login to view] lots of scrapping projects. Please check some of our done p More

$200 USD in 7 days
(15 Reviews)

Hi, I am interested to do these project work. Expert in data conversion work. Please send me more details of work to start. Thanks sunny

$35 USD in 2 days
(25 Reviews)

Hello, I am experienced in working with large files and back-end processing in general. I will definitely finish this project in the next 24 hours. I still need some clarifications before getting started, regardi More

$133 USD in 1 day
(32 Reviews)

Hello there. I have high Excel and Visual Basic skills with great professionalism. I study electronics and computer engineering at Oporto university and I'm looking for work to fill the blanks on my schedule. I' More

$60 USD in 3 days
(14 Reviews)

Dear Sir / Madam, I'm a computer engineer (with BS Degree), working freelance in Istanbul, Turkey. I can complete your project as fast & accurate. Please let me know. Looking forward to hearing from you soon, More

$35 USD in 1 day
(14 Reviews)

Dear sir, I have read your requirement carefully and interested in it. I am expert on data entry, data scrapping and process data. I usually to do it automatic. For your project, I think I can automatic by a prog More

$126 USD in 3 days
(13 Reviews)

Hi i have almready do this kind of job. You can see that in my profile. I am ready to start it. I can do that in about one week.

$250 USD in 7 days
(1 Review)

Hi I have handled public data from [url removed, login to view] for years. Typical csv entries for each year count to 10 million for selected output of 3 million entries. I trust csv file itself can handle size efficiently. It is wha More

$166 USD in 7 days
(6 Reviews)

Hello, I'm interested, I'd to give it a try. Can you provide a sample file so I can send you my attempt? No compromises. Also send me any other information I should need to build a proper processing script, I'm t More

$30 USD in 2 days
(1 Review)

Experts in data processing-cum-manipulation with excellent reviews. We have the necessary tools to handle customized requirements such as yours.

$222 USD in 3 days
(1 Review)

Hello - I am an expert techno-functional analyst having vast experience in lots of arenas of IT industry including Excel Macros. I am an Engineering Graduate with an MBA degree. If you see, I am among the niche bid More

$111 USD in 3 days
(5 Reviews)

I am Data Entry ,MS Word and MS Excel Expert. i am very much professional in this work i am pretty sure that you cant find a best person for this job like me so i am ready to work on your project with low rate and high More

$147 USD in 3 days
(4 Reviews)