
The email address is already associated with a Freelancer account. Enter your password below to link accounts:
Username:
Link your Facebook account to a new Freelancer account
Email address:
Valid username
Project Description:
To design a web scraper which will scan a list of web domains for the presence of the Google Web Optimizer javascript tag on a page or sub-page of a domain from a csv list.
Further details:
- the application should take an input of a csv list of web domains and scan all pages and sub pages for the presence of the Google Website Optimizer content generation tag. This tag is available by registering at https://www.google.com/analytics/siteopt/splash and setting up a dummy test or I can provide an example
- the proposed means of detecting the tag must ensure that all cases of the tag are detected, I will take your technical expert view on this matter
- the output should be a list of those domains which include the specified tag, specifying the pages where the tag was found
- programming language used does not matter for this project
- applicaiton / script must be able to be run on a Windows XP PC
- application must be capable of working from a list of 1 million domains (Alexa top 1m sites list, too large to be attached but can be downloaded / supplied if interested)
- I anticipate that the scan will take some time so this applicaiton must be able to run in an unattended mode and have a pause function. In case of error it should quit without losing progress.
Freelancer.com (formerly GetAFreelancer, Scriptlance and vWorker/Rentacoder) is the world's largest freelancing, outsourcing and crowdsourcing marketplace for small business. Hire freelancers to work in software, writing, data entry and design right through to engineering and the sciences, sales and marketing, and accounting & legal services.
Find freelance jobs and make money online! We have freelance coders, writers, programmers, designers, marketers and more. Getting the best web design, custom programming, professional writing or affordable marketing has never been easier!
© Copyright 2013 Freelancer Technology Pty Limited (ACN 142 189 759)
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)