PDF Batch Splitting and OCR using Ghostscript
- Status: Closed
- Prize: $30
- Entries Received: 4
- Winner: MaliVikram
Contest Brief
We are looking for someone to do some PDF manipulation using Ghostscript. Final result must work on Windows XP (32 & 64-bit) and higher Windows operating systems.
Design a program that will continuously look in a specified folder. When it sees new PDF files appear, automatically Split and OCR the files, move the Singe-Paged/OCR file to a “Done” folder, and then delete the original multi-paged PDF.
This program must be continuously running all the time. The only time we should double-click to start it is after rebooting the computer.
Below is one suggestion I have, but feel free to modify it so the end result is the same:
1. Using the following folders:
1_Specs2Split
2_Specs2OCR
3_Specs2Transfer
2. Automatically look in “1_Specs2Split” folder for PDF files.
3. Split those PDF files into single pages and place them in “2_Specs2OCR”. Name them like this for each sheet:
[original name]-001.pdf
[original name]-002.pdf
[original name]-003.pdf
[original name]-004.pdf
4. Once splitting is done, delete the file from “1_Specs2Split” folder.
5. OCR the files in “2_Specs2OCR” and place them in “3_Specs2Transfer”.
6. Once OCR is done, delete the file from “3_Specs2Transfer”.
Deadline for this project is 3/25/2014 9am PST. Whoever finishes this contest first will be awarded the project.
I will be online Freelancer at 9:30am PST to answer questions or to review your work.
Recommended Skills
Employer Feedback
“Vikram did a great job. Highly recommended. We just hired him for a new project, and he will be our first choice for future projects.”
montauk, United States.
Public Clarification Board
How to get started with contests
-
Post Your Contest Quick and easy
-
Get Tons of Entries From around the world
-
Award the best entry Download the files - Easy!