In Progress

PDF trawler

The Big Challenge - ability to trawl through a PDF, extract bookmarked text and images, and place these in a logical table based on bookmark titles and page numbers. Thus a 20 page document might have pages 1-5 on "Breeding cows", pages 6-17 on "Breeding ducks" and pages 18-20 on "Conclusion". These 'sections'need to be variable, controled by the user.

The final table of data will have column headings:

Section

Title of article

Author

Article

Picture

Picture caption

Budget for project includes additional bonus award for a macro that makes manual process of bookmarking the PDF easier, for example automatically recognises page and right click on mouse once selection made gives drop down of checkboxes, is this selection title, author, aricle, picture, caption?

Please think carefully about the project and avoid asking unnecessary questions. Bidders who present a clear understanding of the project from the beginning and instead of asking, offer smart solutions or plans of how they will go about doing this job, are most likely to be successful. Previous experience in data extraction is favourable.

Thank you for your interest in this project.

John

Skills: Data Processing, Python, Ruby on Rails, Script Install, Translation

See more: pdf trawler, smart final, data challenge, article conclusion, python logical, trawler, pdf script, pages pdf, logical no, cows, Caption, c pdf, bookmarked, extract data pdf python, plans section, python extract, python pdf images, python images pdf, project need python script, installation user manual, page extraction, job award, project challenge script, pdf extract data, checkboxes

About the Employer:
( 11 reviews ) Colombo, Sri Lanka

Project ID: #230223

Awarded to:

kamosion

Please see PMB.

$1500 USD in 13 days
(4 Reviews)
2.8

8 freelancers are bidding on average $2125 for this job

best1

Kindly check PMB

$3000 USD in 35 days
(13 Reviews)
6.8
justinsylas

sir, view pm

$3000 USD in 40 days
(7 Reviews)
5.0
riyoyo

As discussed. Please, send me the sample of PDF document for my review. It seems these tags (Section, Title of article, Author, Article, Picture, and Picture caption) are not PDF default tags. So, we should review your More

$2700 USD in 5 days
(2 Reviews)
2.4
All1Source

Pls see PMB for details.

$1800 USD in 10 days
(0 Reviews)
5.8
imartin83

There is a possibility to automate process with Java. I've already done few similar projects so please send me an example of your PDF so we can discuss your needs vs my current code in order to comply. Please check PM More

$2000 USD in 7 days
(0 Reviews)
0.0
popmax3

I CAN DO IT, JUST SEND ME DETAILS WITH DEMO PDF PLEASE ..

$1500 USD in 5 days
(0 Reviews)
0.0
SharadaJoshi

Could you please send me the details to have a better understanding of what is desirable.

$1500 USD in 15 days
(0 Reviews)
0.0