You have chosen to sponsor your bid up to a maximum amount of .
We need a skilled Python coder to write a script that is going through search result pages, parses the DOM tree and gives us a JSON stream of the objects like headings, links, images etc. (See attachment!)
It is important that we go through the pages on a page by page (10 results per page) basis and extract as much information as possible in a structured way. For an example, have a look at the attachment.
I am a Python guy myself but am caught up in plenty of work. So please let me know how you would tackle this so I can make sure you know what you're talking about. Knowledge of mechanize, beautifulsoup and NoSQL databeses are a plus. Write me for more infos and let me know how long you'd need to finish such a job.