Im looking for a PHP Regex Programmer to scrape all the links from this website:
[url removed, login to view]
that are for each episode and Season...
once its collected the links it needs to follow the link to that page,
eg: [url removed, login to view]
it then has to follow the link "continue".
EG: [url removed, login to view]
it then needs grab the entire URL of the webpage.. and insert it into the database along with variables such as what season and epsiode it is.
I have had several messaged with people showing links they have already scraped, however the links are not correct, the script needs to follow the continue buttons URL where it will then automatically redirect to the correct page.. which is the real URL that needs scraping.
also, although the example that has be brought forward have been impressive, the script would need me to continually type in the new URL for each Show..
so if it would be possible, i would prefer if the script could start on this page: http://tv.blinkx.com/
it would then need to firstly scrape each page for the list eg: http://tv.blinkx.com/?more=_num#shows OR http://tv.blinkx.com/?more=a#shows
then it would need to open each Show and scrape all the links for each season and episode correctly.
PS. please dont forget the script needs to remember which show, season and epsiode.
also, it would be very cool if on the first page, the description of the show could be scraped, as this could also be placed into my database.