Project ID:
466076
Project Type:
Fixed
Budget:
$30-$250 USD
Project Description:
Hello,
I need a software that will do the following.
I have list of urls like this:
http://dictionary.reference.com/browse/article+dashboard
http://www.securityfocus.com/archive/1/486323
http://whois.domaintools.com/wahm-articles.com
http://www.mail-archive.com//msg25805.html
http://profiles.friendster.com/2730188
I want the software that will remove all the things after.ltd and leave the index page so it becomes like this:
http://dictionary.reference.com/
http://www.securityfocus.com
http://whois.domaintools.com
http://www.mail-archive.com
http://profiles.friendster.com
Also the software will remove www from all the urls so it becomes like this:
http://dictionary.reference.com/
http://securityfocus.com
http://whois.domaintools.com
http://mail-archive.com
http://profiles.friendster.com
It should be multithreaded because I will load in 100k urls.
I will feed in the urls from text file.
Will choose the lowest bid.
Thanks
Skills required:
.NET,
C Programming,
PHP,
Python,
Visual Basic