You have chosen to sponsor your bid up to a maximum amount of .
We are looking for a programmer to develop a c++ scraper for financial newsblogs. This should be reasonably commented, and run with parallel threads. The program should:
Authenticate itself (if necessary) on the website
Create a JSON object saving the contents of the article
Some websites that will be scraped are:
The Wall Street Journal -http://online.wsj.com/itp?mod=WSJ_formfactor
Seeking Alpha - http://seekingalpha.com/
The Motley Fool - http://www.fool.com/
..more websites are to come, so the script should have generic elements and be easily extensible
The results will be in JSON structure, preferably inserted into a mongoDB instance (couchDB may also be used), or for testing purposes json files.
Additional Project Description:
11/18/2013 at 11:31 EST
We will accept solutions in a different language if they are run in a parallel fashion.