Web Scraping of news outlets using C++ into NoSQL databases

IN PROGRESS
Bids
3
Avg Bid (USD)
$20 / hr
Project Budget (USD)
$2 - $8 / hr

Project Description:
We are looking for a programmer to develop a c++ scraper for financial newsblogs. This should be reasonably commented, and run with parallel threads. The program should:
Authenticate itself (if necessary) on the website
Create a JSON object saving the contents of the article

Some websites that will be scraped are:
The Wall Street Journal -http://online.wsj.com/itp?mod=WSJ_formfactor
Seeking Alpha - http://seekingalpha.com/
The Motley Fool - http://www.fool.com/
..more websites are to come, so the script should have generic elements and be easily extensible

The results will be in JSON structure, preferably inserted into a mongoDB instance (couchDB may also be used), or for testing purposes json files.

Additional Project Description:
11/18/2013 at 11:31 EST
We will accept solutions in a different language if they are run in a parallel fashion.

Hours of work: 3 Hr / week Project Duration: 1 - 4 weeks Skills required:
C++ Programming, node.js, NoSQL Couch & Mongo, Python, Web Scraping
Hire prosoftwarepack
Project posted by:
prosoftwarepack United States
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


$12 / hr
Hours: 15 hr/ week
Hire jibyjose001
$10 / hr
Hours: 60 hr/ week
Hire julianrath
$38 / hr
Hours: 5 hr/ week