ClinicalTrials.gov crawler

CANCELLED
Bids
31
Avg Bid (USD)
N/A
Project Budget (USD)
$250 - $750

Project Description:
I need a custom crawler that can accept a range of documents from http://www.clinicaltrials.gov/ct2/crawl...
example: http://www.clinicaltrials.gov/ct2/crawl/0 to http://www.clinicaltrials.gov/ct2/crawl/888
and return a csv file with these fields: Could also be mongodb, open for suggestions..

Clinical Study ID
Title
Phase
Study Status
Start Date
start Enroll
End Date
Primary Comp Date
Study Completion Date
Sponsor
Indication(s)
Intervention(s)
# of Sites
Enrollment (Actual #s where available)
List of Countries
Study Design
# of Study ARMs
Can be written in python or java or can be based on an opensource crawler like Nutch, Hetrix, Bixo web mining toolkit, Mechanize for Python, Crawler4j, etc

Skills required:
Java, NoSQL Couch & Mongo, Python, Web Scraping, XML
Hire jimwinburn
Project posted by:
jimwinburn
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.