Project Description:
I'd like to find someone who can have it finished for tomorrow.
Spider where it goes to google and yahoo, searches for what i tell it to, grabs all the results.. puts them in db.. then need to grab additional information about each domain/main page:
- Save each hardlinked domain into database to grab same information from (including keyword used to link to)
- Save and attach any "link URL" on the domain that has addlink.php or add_link.php or addurl.php or /linkex/ or /axslinks/ as a directory
- Save the keyword phrase used that found the search engine listing
- Mark when a domain has been 'spidered' so I can then run the script to do the hardlink domains that were found
Information to save for a domain:
- Age of Domain
- # of Outgoing Links & # of Internal Links on page
- # of Incoming Links (read by total Google/Yahoo/MSN no duplicates)
- Cache date of page
- Page Rank
It'd be nice to be able to display / sort the information with filters, such as must have a certain keyword and/or
a minimum Page Rank and/or maximum or minimum outgoing links and/or max/min internal links, and/or age of cache-date and/or age of domain and/or if there's a "link URL" found.. and then of course list the information displayed nicely.