You have chosen to sponsor your bid up to a maximum amount of .
This task involves the correction of OCR errors on 107 pages of text, according to the detailed guidance given. The original tiff files used for OCR and the corresponding text files produced by the OCR process will be supplied. The corrected text files need to be returned.
The work has to be complete by midnight (London time) Wednesday 12th February 2013.
It is very probable that there will be one more task like this and possible that there will be an additional four after that. Satisfactory completion of this task will give preferred bidder status for any additional tasks.
It is anticipated that successful bids will be about USD 0.75 per page, plus USD 10 overhead for the book. (i.e approximately USD 85 for this task.)
Attention should be focussed on the sample pages given, in particular for the elements that should not be transcribed and the elements that have to be typed by hand as the OCR engine has omitted text within shaded boxes.
Accuracy required is a maximum of one transcription error per page.
Included in the archive are six sample pages which have been transcribed.
Original scans: page016.tiff, page046.tiff, page054.tiff, page081.tiff, page092.tiff, page102.tiff.
Raw OCR: raw016.txt, raw046.txt, raw054.txt, raw081.txt, raw092.txt, raw102.txt.
corrected OCR: page016.txt, page046.txt, page054.txt, page081.txt, page092.txt, page102.txt.
There are also five sample pages which have not been transcribed (test1.tiff, rawtest1.txt, test2.tiff, rawtest2.txt, test3.tiff, rawtest3.txt, test4.tiff, rawtest4.txt, test5.tiff, rawtest5.txt,).
These sample and test pages are typical of the pages that will be included in the bundle. Some pages will be easier to process, some will be more complex. The corrections should be saved as test1.txt, test2.txt, test3.txt, test4.txt, test5.txt).
Only bids which include the five test pages, processed as described above, will be considered.