I require a java command line program to scan a PDF and then add 5 bookmarks to the PDF.
The 5 bookmarks that I want to add are xIntroduction, xMethods, xResults, xDiscussion, xBibliography. These bookmarks should correspond to the headings in the PDF heading of the same name(s).
The program will need to address the following:
a) Multiple headings for each possible bookmark. For example “Introduction” in the PDF may be “Introduction” or “Background”. I will provide a list of possible alternate headings.
b) Note: Some of the PDFs may already have BookMarks. Some PDFs will have no bookmarks.
c) Many of the PDFs have an abstract section that may include the same headings (i.e. introduction, methods, results, discussion). The bookmarks should not be part of the abstract section.
d) Most likely the headings are in a larger font and separated from additional text.
I will send a zip file of 20 PDFs that you can use for testing. I will then test the program on 10 PDFs.
If it helps, I work with pdfbox 1.8.2.
Deliverables for this project is a java command line program and documented code.
References:
[login to view URL]
I've worked with Java and PDF's before, mostly using iText, Apache FOP (XSL:FO) and Flying Saucer (converts XHTML to PDF).
Most of my work with these has been more focused on the generation of documents from within programs instead of updating existing documents. They were mainly either generating PDF reports or (in one case) making a requirement document from a series of letter templates. I've also written quite a few little command line applications over the years to do parts of this as well (if that helps).
5 Days may be a bit much for this, but you noted that you were using PDF Box, so I added extra time in there to get up to speed with this before I start the project. I don't want to deliver something to you that's using different libraries to what you are currently using.
If I have some extra time at the end, I can also put a Swing interface on this if you want.
As with everything I do it will:
Work on delivery
Have full javadoc
Validate the files, inputs and return meaningful error messages.
Be in a Eclipse project (so you can browse / run unit tests easily). If you are using a different open-source IDE (E.g. Netbeans) I can probably convert it to that before delivery if you want.
Not too sure what else I can say on this one. I'm a java dev with around 8 years under my belt and 11 big projects and numerous small utility programs to make those projects easier. Happy if you want to ask me anything else or want a copy of my resume.
$133 USD in 5 days
5.0 (2 reviews)
2.9
2.9
14 freelancers are bidding on average $260 USD for this job
Dear Sir,
I'd like to help you to complete this task successfully. I'm an experienced Software Engineer with solid knowledge of Java development. I have over 10 years of experience in the field and I'm Oracle Java certified with 100% score.
I have more experience working with iText, is a requirement to use pdfbox?
If your are interested in going further I'd like to check the PDFs to make some basic tests.
If you have any doubt, don't hesitate to ask me whatever you need to know. Thank you!
Greetings,
Francisco Fernandez Cabrera
Software Engineer
Can help... I am an Expert...
Please start a Discussion with me and we can get started from there...
Please check the past projects I have handled and check my reviews for what employers have to say about my work... Can start right now...
Hi can you explain more about a) "Multiple headings for each possible bookmark."? do you mean there are possible multiple bookmark positions for each bookmark? Regards Michael
Hello healthsim,
I'm Asem Athwari, Java teacher.
I'm really interested on your project and ready to start immediately. I'm currently working on Digital Library project So, I have an experience with PDF tools. If you have any questions, Feel free to contact me.
Best regards, Clinux.
I know how to use pdfbox for PDF extraction because I've built my own extractor on top of pdfbox. Also I'm familiar with PDF format and document structure, I mitigated many, many problems due to font [or rather text] representation in PDFs.
With over 10 years experience in developing scalable applications in Java/J2EE and having worked with creating command line Java apps and executable jar files, I believe I am the right candidate for this project. I also am very focused on quality and detail-oriented. Please feel free to contact me to go over additional details