We have legacy PC software which requires the ability to access sports data for several different types of sports popular in the US. Our existing system for compiling sports data and putting it into the format used by our legacy software is due for replacement and that is what this job posting is about.
We intend to create a new Sports Data Management System in two phases.
This project is about the first phase and will include an abstract class for scraping sports data from web sites, and an actual implementation of the abstract class for American football.
Once the data has been scraped, additional processing will be required:
First, players and teams will need to be looked up in an existing legacy database in order to obtain their legacy ID number. Heuristic searches based on simple pattern recognition will be required as the names may not always match exactly. Exception processing is required if a player or team can not be matched, and this will be detailed in our specification document.
Once the data has been scraped and the legacy ID number has been found (or exception processing taken place), then a set of CSV files will be generated based on templates provided in a configuration file for the system. A straightforward template which specifies a sequence of tags used by the scraper will be used to specify the contents to be included in each CSV file and the order in which they are to be placed.
The above concludes what we need for Phase 1 of the project. The second phase will be listed as a new project under oDesk and will be preferentially offered to whomever completes Phase 1 to our satisfaction. It will consist of building a Django-based GUI around the system constructed in Phase 1 with a specific set of features which we will detail before the project begins.
We require the system to be developed in Python. With regard to the scraper, our only requirement is that it use a scraping library based on XML tags. The Python code should follow the PEP 8 Style Guide (see [url removed, login to view]) and should be self-documenting using Python docstrings.
Please note the deadline for this project. We have schedules imposed on us by external factors and you should not expect that we will be able to extend the schedule any further, so place do not apply if you aren't confident you can meet the schedule.
We require applicants to be fluent in English and available to have regular meetings with use via Skype (audio).
Given our tight schedule, we are looking for a full-time developer with plenty of relevant professional experience for this job. We reward hard-working and productive individuals with quick payments, positive feedback, and exclusive offers for future work. We do not tolerate poor performance, missed deadlines, or excuses designed to justify the same.
Please ignore the budget for this project and bid whatever you would like to be paid for it.