I am looking for someone who can code me a crawler. The crawler should scan 2 websites and store certain informations from this websites in some CSV files. The information that needs to be crawled is just normal text but is placed different on the 2 websites (nevertheless for one website it is always the same). The crawler should go through the sites and grab all that informations. It should be possible to select which one should be crawled. For each website the crawler should generate CSV files under the criterias i will give you. The crawler shouldn't generate duplicate entries.
Additional Project Description:
02/18/2013 at 23:48 CST
The project was already posted once but was cancelled. The problem was that the content structure of the websites is not regular, missing some close html tags. So please only give your bid when you can adress that problem.