I need to extract information from files in Marathi (an Indian Language) using OCR with Tesseract.
The problem is well defined and the sample pdf are given at these links.
1. [login to view URL]
2. [login to view URL]
The sample file is attached herewith.
If you can write a code to help me extract information from this file in right format, we can go ahead discussing this further.