PDF, DOC, PPT, XLS, eBook --> TEXT I need source code for file handler ASAP-- or CLEAN language translation source

This project received 5 bids from talented freelancers with an average bid price of $ USD.

Get free quotes for a project like this
Project Budget
$30 - $250 USD
Total Bids
Project Description


We need this yesterday!

Most Important: We are trying to extract text from various file types such as DOC, PDF, PPT (notes pages), XLS, and others for use in our website.

Secondary Importance: We need to convert English text or rtf to other languages, Spanish, French, and others.

Can you do the following:

Provide us with source code for independently operating functions that will extract the data (text) from PDF and DOC files as well as PPT and XLS files... we should focus on the 1997-2003 versions of these formats because while the newest Office is reverse-compatible, the formats may not be.

Here is our scenario:

User uploads file (DOC, PDF, PPT, XLS…possibly ebook) -> our file handler recognizes file -> the proper function is called to extract the text -> the text is sent to text box for editing.

Eventually we would like to incorporate OCR for protected sources or images such as PDF, XPS, TIF… but for now, just standard PDF, DOC, PPT, and XLS are necessary.

If you have the source code ready to go, you would have a chance to make some extra money out of it.

We ARE NOT looking to resell the code, we need it as a component in our comprehensive website.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online