Closed

Write a PDF Organising Application

This project received 15 bids from talented freelancers with an average bid price of $174 AUD.

Get free quotes for a project like this
Employer working
Skills Required
Project Budget
$30 - $250 AUD
Total Bids
15
Project Description

I'm wanting a PDF sorting program written in Java that will:

1. Read PDF files from a configured source directory. (SOURCE_DIRECTORY)
2. Perform OCR on the PDF to convert it into a PDF with embedded text.
3. Rename & move the PDF to a new location based on configured text matching rules.

An example rule might be:
IF
the PDF contains text that matches '<BANK_NAME>' where BANK_NAME = 'National Australia Bank'.
AND
the PDF contains text that matches 'Statement Date: <STATEMENT_DATE>' where STATEMENT_DATE is a formatted date.
AND
the PDF contains text that matches 'Account number: <ACC_NUMBER>' where ACC_NUMBER is any string between 5 & 12 chars long.
THEN
place PDF into c:/banking/<BANK_NAME>/<ACC_NUMBER>_<STATEMENT_DATE>.pdf.

I would think that implementing the rule matching using regular expressions would work well, but I'm open to other ideas.

The OCR would need to be done by something freely availably. Perhaps [url removed, login to view]

A GUI for managing the rules would be a nice to have, but initially I'd be happy to edit a configuration file manually.

The program should run as a service and check at configured intervals for any files in the SOURCE_DIRECTORY.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online