This is gonna be a pretty detailed description, so plz bear with me. Basically what i want is to make an application which reads words from an input file, and searches a PDF file for occurrences of those words, and counts the number of occurrences. I recently found out that Adobe provides "Acrobat SDK" for controlling acrobat reader, or for plugin development, and a couple of other stuff. [url removed, login to view] On the link above u can get documentation for Acrobat SDK, and also download the SDK. What i am interested in is :- In the Acrobat SDK 8.1, adobe has provided a sample Visual studio 2005's Visual Basic project, which (Adobe claims) uses acrobat to search for the words in the pdf file using something called acrobat search plugin. All the required stuff is already there on the system if u have acrobat 8 installed on ur system. Now i am attaching the above mentioned visual basic project with this bid request. Please try to compile and build and run this using visual studio, and analyze this app a bit. I do not have visual studio so can't compile this, but i am assuming that this app is taking inputs from a windows form and is searching for those words in the PDF. What i want is for someone to modify this VB app, to take the input from a file, and count the number of occurrences of the words in the input file in the PDF, and print these details in another file. Let me know if i have missed anything in this bid request. the format of the input and output text file is pretty simple and i will give it to interested bidders. Don't bid if your rating is less than 8. Thanks Addy [url removed, login to view]
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
I am myself an experienced java programmer and havi tried my hands at searching for text in PDF by using libraries to extract text from PDF and then using regular expressions and have also tried using lucene, but whatever the hell i do, it does not work perfectly, and i need perfect results, 100% consistent with acrobat reader 8. So please do not give me suggestions to use something other than Acrobat SDK.