Closed

Ocr scanned pdfs to text app

I need a web based app that will accurately OCR scanned pdfs. The uploading of the pdfs will be done through a web interface. The app will poll a directory of pdfs and any new pdfs that gets dropped into the directory will immediately be OCR'ed into a text file. The OCR engine must be able to deal with scans and images.

Once the file is OCR'ed, the app must find a dynamic list of user supplied regex expressions and output the results into a csv file for each pdf.

The polling can be a cron job or daemon, I don't care, but you need to instruct me on how to set it up.

The app can be done in php or rails (preferably rails).

The web interface must use bootstrap or [url removed, login to view]

Before I award you the project, I want to see a sample app that can OCR the scanned pdf. Once I am satisfied that your solution can output decent text that matches the scans, I will award you the project 50% down and 50% upon completion and code transfer.

Skills: Bootstrap, MySQL, PDF, PHP, Ruby on Rails

See more: ruby on rails ocr, regex matches, rails job, find ruby on rails, find job ruby on rails, ed don, app engine cron, ruby on rails bootstrap, rails any, find a code app, bootstrap ruby on rails, ruby rails job, ruby job, ocr text, ocr app, images to text, how to award, deal app, set rails, web app rails, rails web app, php ocr solution, php code ocr, bootstrap ruby, text based interface

About the Employer:
( 30 reviews ) Houston, United States

Project ID: #4946965

6 freelancers are bidding on average $435 for this job

ranganathp

Can help... I am an Expert... Please check the past projects I have handled and check my reviews for what employers have to say about my work...

$1000 USD in 21 days
(27 Reviews)
6.3
codeware1

Hello, Hope you are well and doing well!!! We are eager to work with you. We are ready to start the project right away. CODEWARE is a web & Mobile application development company providing professional desi More

$309 USD in 12 days
(31 Reviews)
5.8
drudev

Hello. I am a web developer with 11+-year experience . I work remotely on DigitalRay company - [url removed, login to view] - LA (USA). Technical knowledges: 1PHP JavaScript Ruby RoR ASP.Net C#, Zend, CakePHP, Symphony frameworks More

$333 USD in 3 days
(14 Reviews)
4.2
sandeepsrm23

Hi , I want to handle this job for you , Please see you PMB.

$311 USD in 7 days
(1 Review)
4.0
DavidLiu80

Invited, Hi, I developed before the similar app which does ocr a pdf file and extract specified texts. At that time, I developed in C# using Tesseract engine. Now I can easily it convert to your purpose. Please contact More

$333 USD in 5 days
(2 Reviews)
2.6
dandavis68

I wrote the code used by eDiscovery appliance software firm Index Engines, Inc. to OCR PDF using Nuance's OCR Linux library. I also wrote the first version of their query server, php admin server, php query interface, More

$333 USD in 3 days
(0 Reviews)
0.0
Fu22yLogic

invited ... check PM...

$300 USD in 14 days
(0 Reviews)
0.0