Closed

Convert Microsoft .doc and/or OpenOffice .odt text files to plain text

This project was awarded to igors233 for $240 USD.

Get free quotes for a project like this
Employer working
Awarded to:
Skills Required
Project Budget
$30 - $250 USD
Total Bids
8
Project Description

Implement in Delphi 2010 (or compatible) a non-visual class called TTextFileConverter that can convert one or all of the following file formats to plain text:
* Microsoft Office .doc
* OpenOffice .odt
* .RTF file
* HTML files

By converting a file 'to plain text' I mean that the output of the process is a Delphi String that contains the text content without any formatting (except whitespace).

For example, if I would open up Microsoft Word, write a text document with a text 'Foobar 123' with the '123' in red font and 'Foobar' as a headline, save it as 'c:\[url removed, login to view]', calling TTextFileConverter's Function ReadFile('c:\[url removed, login to view]') would return a string 'Foobar 123'. You can modify the parameter list of the ReadFile function in order to return an error code in case the was an error in the analysis of the file.

Note: With your bid, please state whether your solution supports all the four file formats (.doc, .odt, .rtf and .html) or just some. If you can only provide a solution that supports e.g. .doc and .html, you may still place a bid but you must state this with your bid. I may not find a coder who can do all these four formats, so in that case I will consider other bids. Do not offer any solution that relies in other programming languages or the use of any DLL files or any proprietary code or libraries.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online