Scrape Simple Website with Screen-Scraper Software
(1) Experience with Screen Scraper
(2) Extensive knowledge with JAVA
(3) Must agree with standard Non-Disclosure/confidentiality reply
(1) Goto Index Page
One long index page list all the links to the details page, by date.
(2) Goto Details Page, by specific date
(3) Scrape the following
One Entry - Scrape approx 6 fields, which have only one entry
Multiple Entries - Scrape 1 field which contains multiple entries, seperated by pipe
Multiple Entries - Scrape 1 field again which contains multiple entries, seperated by pipe
(3) Write to TSV
(4) DL corresponding images
(1) Sometimes one important field isn't posted yet, skip the record.
(2) The two fields with multiple entries must match, i.e if we were using CSV it would look like this
...,line1-aaa | line2-bbb,line1-19.95 | line2-0.00,...
if there is no matching entry then write NULL, for example
...,line1-aaa | line2-bbb,line1-19.95 | NULL,...
(3) Since we will be running scrape looking for new information every 4 hours a day, don't write duplicates.
Project Due Date:
4-5 days after bid acceptance
Look forward to Bids