You must provide source code of C# application/assembly that provides exact extract of all texts from inside any idml (InDesign) file in way that it can be passed over to another text analyzer. This means the text needs to be extracted in order that human reads it. Moreover, xml files from inside idml file need to have all exported items marked in a way it is possible to match every exported text item with appropriate xml element in xml file – for reverse operation (e.g. replacing text). 3 sample files of idml files are provided (and their PDF exports). Final application will be tested against these provided files.
C# .NET (preferable 2.0 but not a must)
SVN (server & credentials will be provided)
Only OOP approach is allowed.
Any InDesign (.idml) file.
This zipped file format contains many xml files. It is your goal to figure out what is what. I do not provide details on idml format.
Whole layout saved in idml file is divided in some number of xml files, where some files refer/include others. Some texts (e.g. paragraphs) may be divided into many xml items. Files may contain many different types of items (all implemented as xml), e.g. tables, images, etc.
You are not allowed to use any 3rd-part component unless discusses and approved before start of work. Any 3rd-part components (or licences of those components) must be discussed before start.
See details in attached files.