Write a PDF Organising Application

This project received 15 bids from talented freelancers with an average bid price of $174 AUD.

Get free quotes for a project like this
Project Budget
$30 - $250 AUD
Total Bids
Project Description

I'm wanting a PDF sorting program written in Java that will:

1. Read PDF files from a configured source directory. (SOURCE_DIRECTORY)

2. Perform OCR on the PDF to convert it into a PDF with embedded text.

3. Rename & move the PDF to a new location based on configured text matching rules.

An example rule might be:


the PDF contains text that matches '<BANK_NAME>' where BANK_NAME = 'National Australia Bank'.


the PDF contains text that matches 'Statement Date: <STATEMENT_DATE>' where STATEMENT_DATE is a formatted date.


the PDF contains text that matches 'Account number: <ACC_NUMBER>' where ACC_NUMBER is any string between 5 & 12 chars long.


place PDF into c:/banking/<BANK_NAME>/<ACC_NUMBER>_<STATEMENT_DATE>.pdf.

I would think that implementing the rule matching using regular expressions would work well, but I'm open to other ideas.

The OCR would need to be done by something freely availably. Perhaps [url removed, login to view]

A GUI for managing the rules would be a nice to have, but initially I'd be happy to edit a configuration file manually.

The program should run as a service and check at configured intervals for any files in the SOURCE_DIRECTORY.

Skills Required

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online