You have chosen to sponsor your bid up to a maximum amount of .
Want some delphi code that will return a stringlist of words (not symbols) from a pdf, word doc, excel spreadsheet or powerpoint file.
The function needs to receive a stringlist of words to exclude (e.g ignore) from the returned stringlist
Need something that is lightning fast
Needs to support delphi 5 and onwards
Additional Project Description:
11/15/2013 at 17:45 PET
An rough example
function return_plain_text_words_fromfile(excludethese: Tstrings; extractfromfilename: string):Tstringlist;
// call by mystringlist:=return_plain_text_words_fromfile(excludelist, 'c:\my documents\my word.docx');
// open file
files will be of this type
word *.doc, docx
spreadsheet *.xls etc
// extract words excluding symbols
// add words to results stringlist
// exclude those in excludethese list
// return result
11/15/2013 at 17:56 PET
example of exclude list >> words like 'the','and','a','to','etc'
I just want plain text words retruned from the file without any formatting characters, symbols, rtf code and the like