OCR means Optical Character Recognition. It is used for digitizing scanned images and then recognizing the letters therein for further processing. If your business needs help with OCR you can use the services of our freelancers. Post a project today in order to get bids from freelancers. Hire OCR Developers

Filter

My recent searches
Filter by:
Budget
to
to
to
Skills
Languages
    Job State
    36 jobs found, pricing in USD

    Change PDF documents to editable (so i can put it in google translate)

    $18 (Avg Bid)
    $18 Avg Bid
    39 bids

    we have two type of documents: - multipage PDF files (could already contain also OCR detected text) - multipage Tiff files These pages contain the standarized patchcode T separator pages. Samples of the patchcode T - [url removed, login to view] on page 11 - [url removed, login to view] on page 75 Your job is to provide us a shell script which - gets as input either a PDF file or a Tiff file (choosable by param) - parses through the file and splits the file the by given patchcode T into multiple files (with same filetype) - does OCR of the content (shall be switchable with on/off to decide if OCR shall be done or not) Ensure the pagecode page can have any arbitrary content between the code lines (like in the samples)

    $88 (Avg Bid)
    $88 Avg Bid
    6 bids

    I have multiple physical copies of documents with the same structure.I want to scan all those files with ocr and converted that data into one excel file.

    $115 (Avg Bid)
    $115 Avg Bid
    22 bids

    I need a very simple JOB, which envolves reading a JPEG file (From a specific URL or byte arrays sent via POST) bringing back to me all the categorized data from a specific Brazilian document (Model attached here). I can't receive a string with the complete text, we need to categorize data, for example: Plate, Name, State, etc. This code can run as a API, or Windows Forms. I prefer to use C#. You can use the Vision APIS from Google Cloud or Azure as well.

    $216 (Avg Bid)
    $216 Avg Bid
    16 bids
    Train tesseract version 4 1 day left
    VERIFIED

    Traint Tesseract version 4 to identify a font. And supply the files and syntax to use the trained data for OCR. Tesseract is capable of recognize 99% of the strings without any training, after rescaling and Grayscale with ImageMagick. But it needs to be better. Perferably without ImageMagick Please confirm that You have understood that it is Tessercat version 4 ! I have attached short example.

    $214 (Avg Bid)
    $214 Avg Bid
    8 bids

    Help needed for my decentralized web project

    $179 (Avg Bid)
    $179 Avg Bid
    6 bids

    i want to resolve some simple captcha image. i can resolve captcha using captcha sniper but i want to use c# code to resolve without captcha sniper assist. to do this i found [url removed, login to view] can resolve this captcha image but not sure. if you know or have some c# library to resolve this captcha pls bid me. i upload captcha image i think if more than 50% success rate it enough for me.

    $130 (Avg Bid)
    $130 Avg Bid
    17 bids

    Hello! I am looking for an expert in Optical Character Recognition and a programmer to make some improvements to a currency conversion mobile application for iOS. The application converts prices from one currency into another using OCR from the user's camera. A number of updates are needed, including: - Improved OCR capabilities for small characters. - Recognition of characters of various colours on various backgrounds - Offline functionality - A manual currency input option I can provide more information privately after you have made a bid. Along with your reply, please start your message with the answer to 2+2 so that I know you have read the whole of this post. Thanks!

    $206 (Avg Bid)
    $206 Avg Bid
    7 bids

    i want to resolve some simple captcha image. i can resolve captcha using captcha sniper but i want to use c# code to resolve without captcha sniper assist. to do this i found [url removed, login to view] can resolve this captcha image but not sure. if you know or have some c# library to resolve this captcha pls bid me. i upload captcha image i think if more than 50% success rate it enough for me.

    $23 (Avg Bid)
    $23 Avg Bid
    6 bids
    simple captcha solve Ended
    VERIFIED

    i want to resolve some simple captcha image. i can resolve captcha using captcha sniper but i want to use c# code to resolve without captcha sniper assist. to do this i found [url removed, login to view] can resolve this captcha image but not sure. if you know or have some c# library to resolve this captcha pls bid me. i upload captcha image i think if more than 50% success rate it enough for me.

    $25 (Avg Bid)
    $25 Avg Bid
    3 bids

    The attached word document is ESSENTIAL to understanding this project as it contains very important images. I will ask if you have read the attached brief before I will accept your bid. This is a short description of the project. Please read the attached document for the whole story. We need a SOLR search engine built from old, multi-page PDFs. All of the indexed documents will be PDFs and many will need to go through OCR first. We will probably use something like Foxit to do the image to text conversion. We know the output will be messy, but text will only be used in indexing process. When user does a search, s/he will access the PDF directly. Note: All of our work is in Java. This will be running on a large Linux server. This project is not that simple though. Let’s take a look at this example > [url removed, login to view] We will want to index this 30 page document. But it contains more than one form (unique section). State Oil & Gas sites will often put an entire wellbore’s files in a single PDF. 20 years of paperwork can be sitting in a single PDF. If we index as-is and return results with a 30 to 100-page PDF attached, the user will never be able to find the single mention of their search string after opening the very long PDF file. For this reason, we need to break the 30+ page PDF into individual pages, OCR each, and index each page separately. When doing a search, user is actually searching individual pages. We tell the user we found the queried text on page 19 of the PDF. S/he clicks to get the full 30 pages, but knows to go to page 19. We may even load the PDF in a frame and keep a header at the top that reminds user to look on page 19. And there may be multiple mentions of the search query in a single PDF file. A lot of it will be nasty looking. Documentation goes back 50+ years to typewriters. If this all seems pretty impossible, you would be right. In fact, we believe the OCR will be so incomplete in places, we cannot even show a snippet (10-20 words) of text on the search results page, because it will be nonsensical. But this is ok. If we can OCR 70% of the data from these PDFs, that’s 70% we didn’t have yesterday. And no one will ever see the OCR text to complain how incomplete it is… Why are we going to all this effort? We plan on using SOLR to build a metadata engine around these documents. We are less interested in the content of each page and more interested in the page type, that a particular wellbore even has a C-144 form. We'd like to get as much data as we can but realize we won't be able to get it all. The end user will probably do very little “free text” searching of SOLR. Instead, we will process 10,000 of our own search phrases (tokenization and algorithms), e.g. “Tank Closure” or “C-144” and build a table of all the document types that are inside PDFs for each wellbore. We may tell a user that wellbore [Removed by [url removed, login to view] Admin - please see Section 13 of our Terms and Conditions] Now, it starts to make sense why we are breaking apart all the PDFs for OCR and indexing. We may store page 1, 2, 3, 4 and 5 in a database row for wellbore [Removed by [url removed, login to view] Admin - please see Section 13 of our Terms and Conditions] We cannot stress this enough. The user never sees the OCR text or the broken apart PDFs. Will be way too confusing. Instead, we will direct the user to open the original PDFs and go to page 6 or page 1 or page 27 and read further about a tank disclosure for this particular wellbore. Expect 10-15 million PDFs. If this work is good, we have many more follow on projects from this that we will LOVE for you to work on. OK! That should be enough to communicate the main purpose of this project. Please read the attached document which has more detailed information about the entire project.

    $2285 (Avg Bid)
    $2285 Avg Bid
    9 bids

    Take a PDF file and prepare it to a Word file. The file consists of Norwegian text plus numbers, but knowledge of Norwegian is not necessary)

    $18 / hr (Avg Bid)
    $18 / hr Avg Bid
    61 bids

    You must have a prior experience with PyPDF2 or Image Processing. In you bid please let me know which part of the project interests you. See the details here: [url removed, login to view]

    $8 / hr (Avg Bid)
    $8 / hr Avg Bid
    5 bids

    I am looking for someone who can correct the text within PDF files in which the text has been extracted using OCR. However, as expected there are character recognition errors in the generated documents, and would need to be corrected as per the original PDF files that were used as a basis for the character recognition. There are about 40 documents (max 2 Pages - Cover and Content) that need to be corrected and you will need to have Acrobat Editor in order to correct the OCR generated files. The diverence in the OCR and the bitmap based PDF's is about 10%.

    $117 (Avg Bid)
    $117 Avg Bid
    34 bids

    We have a staff card and want to extract staff ID. We want to modify the open source OCR project to recognise staff ID from our staff card.

    $2426 (Avg Bid)
    $2426 Avg Bid
    27 bids

    I need to have done android activity which uses camera and OCR engine to get a data from passport's machine readable zone. Just one single activity. camera input and textual data on output.

    $193 (Avg Bid)
    $193 Avg Bid
    30 bids
    OCR App for android Ended
    VERIFIED

    Android developer needed with Google Tesseract/Open CV OCR library experience. All candidates Must successfully OCR an image as part of hiring process. No exception. Whoever is not willing to participate in the demo, are encouraged NOT to apply. Thanks

    $560 (Avg Bid)
    $560 Avg Bid
    44 bids

    I need to Recognize Digits Written in Marathi Language & Input same into the Database

    $434 (Avg Bid)
    $434 Avg Bid
    19 bids
    OCR image process Ended
    VERIFIED

    We are wanting to capture information from windows applications and web pages in a dynamic, configurable way.   This should include being able to OCRing portions of the window, matching average pixel colors, single pixel checks, comparing a region with a sample or template image. All of these techniques and more should be configured and tested via a user interface.  The user interface should allow an image to be loaded into it (or window selected, which will have an image take of it) and regions marked out, along with any of these checks and rules on what data it generates. For example, a rectangle would be drawn and positioned on the sample image, then an average pixel color method would be selected and a color selected, a key/name is then given for this, along with a rule that it is a boolean (another example of data type would be string) and it's TRUE if the average color matches.

    $3373 (Avg Bid)
    $3373 Avg Bid
    11 bids

    Who can Help me? I need scrap this website [url removed, login to view] , to do this its necessary use rotative brazilian proxy and solve the Google Recaptcha. To access the website its necessary brazilian proxy.

    $535 (Avg Bid)
    $535 Avg Bid
    11 bids

    Top OCR Community Articles