Cancelled

Pandoc RTF Reader

A Pandoc Reader for RTF in Haskell

Pandoc is an Open Source program written in the language Haskell that converts in between several markup file formats, among them HTML, LaTeX and DocBook.

The program does its job by using a Reader and a Writer for each of the formats. All source code is available from:

[url removed, login to view]

I would like to create a Reader for the RTF format, able to read files all the way to ver. 1.9.1 of the RTF specs - to be added t the Pandoc tool.

The Reader should use Haskell alone, abide by the structure of Pandoc and be able to work with Pandoc to translate files in other formats. I'll provide RTF test files.

Resources:

RTF ver. 1.6 Specs:

[url removed, login to view]

RTF ver. 1.5 Specs:

[url removed, login to view]

Wikipedia:

[url removed, login to view]

unRTF:

[url removed, login to view]

rtf2latex2e:

[url removed, login to view]

Attached is an RTF file with the examples of what couls/should be converted, to be used in tests.

w2LaTeX: http://www.ctan.org/tex-archive/support/w2latex

The reader for RTF should at least handle:

* convert text style: bold, italic, color, big, small,...
* underlines and strikethroughs
* expanded and condensed text
* support for text shadowing, outlining, embossing, or engraving
* changes in the foreground and background colors
* reads embedded figures: PICT, EMF, GIF, TIFF, WMF, PNG, JPEG
* reads tables
* superscripts and subscripts
* equations: converts embedded MathType equations
* symbols: greek and math symbols
* reads footnotes (on pages and in tables)
* hyperlinks
* internal document page references

and be able to process the attached RTF file in here.

Pandoc has readers written for 7 languages: markdown, reStructuredText, textile, HTML, DocBook, LaTeX and MediaWiki markup, one should look closely at these readers to have an idea how to proceed with a new one.

Skills: Microsoft, Software Architecture, UNIX

See more: pandoc rtf, latex job, html 5 code reader, haskell job, pandoc latex, RTF, linux writer, file reader, architecture office, rtf library, latex html, pandoc rtf reader, latex file, latex format, rtf2latex2e, reader software, microsoft office test, markup tool, haskell rtf, net aspx linux, latex latex latex, code code reader, html reader, open aspx html, wiki software net

About the Employer:
( 63 reviews ) Oakland, United States

Project ID: #4410408