Project Description:
I would like to scrape the data from the following websites every 5 minutes. Some have an XML feed and some just have live lines on their site without having to login.
I need the event name, date, time the game starts, rotation number, what teams are involved (and which is home/away), type of bet, odds. Some sites also have alternative markets such as team +2.5, but not every book does. I'd still like to capture those for the sites that do. Also, I'd like the 1st half/2nd quarter etc lines if available too. I'm a little familiar with XML and could work with that, but it doesn't seem like all the sites have a XML feed and I don't know how to scrape the data from those places.
Also, I'd like for it to check if the line changed. If it did add the new entry to the database with the time it checked, if not then don't bother to add it.
The sites I want to get data from are:
Pinnaclesports.com / 5Dimes.com / WSEX.com / Matchbook.com / TheGreek.com / sportsinteraction.com / bookmaker.com