PHP or Ruby discussion board scrapers

IN PROGRESS
Bids
8
Avg Bid (USD)
N/A
Project Budget (USD)
$100 - $1200

Project Description:
We're looking for a Ruby or PHP app that will accept the name or id of a message/discussion board on the following sites and download its content into a MySQL database:

1. abc.com
2. hulu.com
3. any Simple Machines Forum

Both sites 1 & 2 use extensive Javascript to generate views of their boards. You must be willing to license your code under GPL or a similar open license. Your code must not adversely affect the host site's performance. We are willing to break the project into milestones or individual projects for each site.

The basic structure of the DB will be (your comments and suggestions welcome):

Sites: id, name, front_page_url, other_metadata
Users: id, name, join_date. running_post_total, other_metadata, site_id
Topics: id, subject, date_opened, running_post_total, other_metadata, site_id
Messages: id, content, date_posted, other_metadata, user_id, parent_message_id, topic_id, site_id

Summary of relationships -
users:sites 1:1 (may turn out to be 1:many, for now assume 1:1),
topics:sites 1:1,
messages:sites 1:1,
messages:messages 1:many (replies),
users:topics 1:many,
users:messages 1:many,
topics:messages 1:many.

May not need to store site_id everywhere given those relationships, but DB should allow for fast querying of topics per site, for instance.

Skills required:
Javascript, MySQL, PHP, Ruby on Rails
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.