Scan the words-software

Create a software that scans an e-book file and creates a database file which contains hard words contained in that e-book. So the software you make should recognize the hard words from any text file [like .pdf, .doc, .docx, .djvu, .epub, .lit, .pdb, .azw]. It should use two databases to do this job one containing a list of "hard words" and other "all words" in the english language.

So the software finds the words in the e-book file which are hard; where the hard words are those which are present in your "hard words" database and creates an output database file (likely to be .sql or .csv) of list of hard words.

Secondly the software should also be able to identify words which are not hard and also not present in the "all words" database of your software which we define to be new to the software. So the software should also create a second output file which are words which are new to the software as they are not present in the "all words" database.

Important features of the software:

Create your own database of hard words with headers 'word' and 'meaning.' You may create such database file from word-lists available over the internet like GRE top 8000 words or similar.

It should be able to create the two output database files in different sorting methods like alphabetical and the order in which they appear in the e-book file.

Output database will also contain two headers "name" and "meaning"

Databases used should be large enough to recognize maximum words though it should avoid redundancy.

Meanings can be synonyms but should be reliable and easy

Softwares should have the feature of adding new words to the database or replacing it with a new database file.

Software need not to be good at interface. More important is its functioning according to the parameters described.

Skills: C Programming, Java, Software Architecture, Software Development

See more: to the maximum, database one word or two, architecture define, e book software, scan to text, pdf to Epub, gre, according to the, alphabetical word, magento paypal advanced appear order, word alphabetical order, sql file csv, pdb csv, Pdf scan, software dual audio output, replacing words synonyms, 8000 important words, create pdf epub, english synonyms database, 8000 english words

About the Employer:
( 0 reviews ) India

Project ID: #7140082

6 freelancers are bidding on average ₹17304 for this job


A proposal has not yet been provided

₹72000 INR in 10 days
(14 Reviews)

A proposal has not yet been provided

₹11666 INR in 1 day
(3 Reviews)

A proposal has not yet been provided

₹1300 INR in 1 day
(6 Reviews)

Hello there, my name is George Chondrompilas and I am a Software Engineer with more than 3 years of experience on many fields. I think I am the right person for this project because I have the required experience to fi More

₹5555 INR in 3 days
(1 Review)

A proposal has not yet been provided

₹1300 INR in 1 day
(0 Reviews)

I have prior experience on projects of a similar nature, much of the existing code can be reused to speed up development time. Feel free to ask me any questions before you decide Proposed Milestones: Create a More

₹12000 INR in 7 days
(0 Reviews)