I need a webcrawler that can find websites that are using or have Flash elements on their website.
Furthermore it needs to be able to differentiate the language of the website. I am ONLY interested in finding German and English language websites!
The project literally requires that I find ONLY websites in German and English language. That could be a whole number of different domain endings such as .de .at .ch for German language and .com .[login to view URL] .[login to view URL] [login to view URL] and so forth for English. But I definitely don't want websites that are in any other language (french, spanish, italian, polish, russian, japanese, chinese etc).
The crawler needs to generate a list of domains and their e-mail addresses for THAT language.
The crawler needs to have a GUI where the above parameters most be adjustable. It also must have the possibility to check against a blacklist, so if he hits a known domain or email, it gets skipped from the beginning.
There is a second stage where the crawler is supposed to find all websites that do not have an SSL certificate.
If the crawler can do all 3 of the above things, that much the better. But the first stage is to find websites with Flash elements.
I need a minimum of 2 million addresses per language
I am looking forward to your offers!
53 freelancers are bidding on average $1234 for this job
I have checked your requirements where you mentioned that you need a webcrawler that can find websites that are using or have Flash elements on their website. Please invite me in chat so we can discuss in detail.
Hello, How are you? Yes i am expert in web scraping . i have more than 6 year experienced in php. i can do this scraping in php, You can check my work:- [login to view URL] Thanks Shweta