Script - extracting information from HTMLs

This project was awarded to scripish for $60 USD.

Get free quotes for a project like this
Project Budget
$30 - $250 USD
Total Bids
Project Description


I need a script that will do the following:

Task 1:

1. I will inset a list of URLs (can get up to 50,000 URLs and more)

2. I will insert a list of KWs (such as: casino, online casino, online poker, etc)

3. I will choose where I want the script to check for these KWs (meta title, meta KWs, meta description)

4. I will get the following inputs:

a. A total number of URLs found with the KWs

b. A list of all these URLs

Task 2:

1. From the list of URLs that I insert, I need to get a list of how many URLs I have for .com, .[url removed, login to view], .de and so on

a. I need absolute numbers (.com = 30, .de = 56)

b. I need lists of all URLs per .com, .de etc

** The software should run on multiple threads

** Provide examples of other tools that you've developed.

Thank you

Awarded to:

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online