OABB - Extract data from PDF & webpage

IN PROGRESS
Bids
51
Avg Bid (USD)
$353
Project Budget (USD)
$250 - $750

Project Description:
URL: http://www.immunoleader.com

I. Save data from PDF and product page:
- using a list of catalog numbers.
- go to url and search each catalog number.
- open product page.
- open PDF link at the right side & download PDF.
- extract data from PDF.
- save data from each "bold" field in PDF into excel sheet.
- if PDF file has fields that are not already named in Excel, please add them into new column in Excel.
- excel sheet format as follow:

-------------------------------------------------------------------------------------------------------------------------------------------------------------
| catalog_number | product_name | lot_number| size | application | ... | image1 | image1_caption | image2 | image2_caption | ...
-------------------------------------------------------------------------------------------------------------------------------------------------------------
- Please use product name in large text in the product page for excel field "product_name".
- Please use data from "Applications&Reactivity" in the product page between "Code" and "Size" for excel field "application".
- Each bold field in PDF should have a column header in Excel sheet.

IMPORTANT:
+ Please add any new field in PDF to excel sheet if needed.
+ Please try to keep "superscript" text and symbols.
+ Each row in Excel will be for one catalog number.


II. Please save image:
- Save image as [catalog number]-image-1.jpg, etc...
- Put image name and image caption in excel.

- Image save example:
catalog_num: PA1003

image1:
PA1003-image-1.jpg

image1_caption:
Lane 1: Rat Testicular Tissue Lysate
Lane 2: Rat Brain Tissue Lysate
Lane 3: MCF7 Cell Lysate
Lane 4: MM453 Cell Lysate
Lane 5: SMMC Cell Lysate
Lane 6: Hela Cell Lysate
Lane 7: Colo320 Cell Lysate


- Please refer to sample excel template in attachment.
- I will send the list of catalog numbers when starting the project.
- There are 1329 catalog numbers.
- Please save all PDFs if possible.

Thank you for your time.

Additional Project Description:
12/14/2012 at 1:16 CET
NOTE: Please refer to attached sample excel and screen shots.

Skills required:
Data Entry, Excel, PDF
Additional Files: PA1003-image-1.jpg oabb_sample_template.xlsx screen_shot_3.png screen_shot_2.png screen_shot_1.png
Hire biodirector40
Project posted by:
biodirector40 United States
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


$ 499
in 7 days
$ 700
in 20 days
$ 450
in 6 days
$ 475
in 9 days
$ 500
in 10 days
Hire rajeshsonisl
$ 1000
in 4 days
Hire mantislin
$ 319
in 6 days
$ 250
in 6 days
Hire Blurredge
$ 250
in 10 days
$ 250
in 7 days