This project received 8 bids from talented freelancers with an average bid price of $277 USD.Get free quotes for a project like this
Please see attached PDF files.
I need a program with complete code that will read pdf files and display the text in text files.
The text in text files will have to appear as it appears in PDF files.
Please 'More Information to PDF Reader [url removed, login to view]' file attached. This will explain how pdf program has to run in more details.
There are PDFBox and iText libraries to read pdf files. You could also use any other language or libraries you are good at.
The PDF files information:
1) It has characters in English and Kannada language.
2) Kannada is Indian vernacular language.
3) It has unicode points range U+0C80 to U+0D7F.
4) The PDF files are identity-H encoded.
In the following link, you also could have a look at unicode points for Kannada language characters.
[url removed, login to view]
Please select drop down and select Kannada.
Please send/post me sample output, so I could see if the program could extract the text in correct order.
For example: Out of 30 pages, extract information from page 1 and page 3 and organize in correct order as mentioned in 'More information to pdf reader [url removed, login to view]' file. I will let you know if the order is correct and need any modification. I will then award you the project.
Please let me know if you have any questions.
Looking to make some money?
- Set your budget and the timeframe
- Outline your proposal
- Get paid for your work
Hire Freelancers who also bid on this project
Looking for work?
Work on projects like this and make money from home!Sign Up Now
- The New York Times
- Wall Street Journal
- Times Online