I am looking to employ a contractor to proof read and correct the OCR text generated from an original PDF scan. Each job will be typically 15 to 20 original pages of text . There is a significant amount of numbers and technical references that need to be checked against the scan and where errors are found in the RTF output, these must be corrected.
This is part of an exciting project capturing historical data from magazines relating to the railways of Britain in the 1960s. These magazines were produced every month over 12 years and I have started to scan the relevant pages into PDF documents and then pass the PDF documents through an OCR software package that generates an RTF document.
It is NOT important for the final document to be re-formatted in its exact original layout. The important part is that the electronic copy is a very good reproduction the original text - word for word (punctuation included).
Particular attention needs to be paid to standard OCR errors such as ensuring that lower case l (L) and capital I (i) are not confused with the digit 1 and the same goes with the letter O and the number 0.
The output will preferably be in MS Word format using Monaco 12 point type face throughout.
I have supplied an example scanned document from July 1968 and the resulting OCR output in RTF for you to be able to estimate accurately the time and effort involved for a specific piece of work.
If you have questions please ask.
Many thanks and good luck