This project is actually a presentation that needs to prepared for a seminar. The following has to be programmed in order to teach a class of 30 students. Due to time constraints, I do not have time to do them myself. For anybody with even basic java programming, this should be an easy one.
IMPORTANT: THIS NEEDS TO BE FINISHED IN ONE WEEK.
There are three tasks altogether:
Using Google AppEngine (in Java), write a simple text editing application. You may want to investigate TinyMCE as a potential editing solution.
Your application must allow for the following functionality:
2) View previously saved documents
3) Edit a saved document
4) Create a new document for editing
5) Save and save as functionality.
All documents must be saved in the cloud. Your application must be deployed and running. When you submit your application, ensure that you supply the URL where your deployed application can be tested.
You must supply your commented Java code along with document recording this URL and detailing the design and functionality of your application.
For this assignment you are required to implement the bpref Information Retrieval evaluation metrics. You must create a program (using a programming language of your choice) that can do the following:
ï?¬ Accept two command-line arguments. The first is the name of a file containing relevance judgments for a set of queries. The second is name of a file containing a set of results that need to be evaluated. The format of these files is given below. Samples of how these files are structured are posted on Moodle for the MSCC-IRWS course. As an example, your program will be run like this (depending on the programming language used):
â? perl myprogram.pl relevant.txt results.txt
â? python myprogram.py relevant.txt results.txt
â? myprogram.exe relevant.txt results.txt
â? java MyProgram relevant.txt results.txt
ï?¬ The file containing relevant documents has got three columns. The first column contains a query ID. The second column are the IDs of documents. The third column contains a '1' if the document is relevant to the query, or a '0' if it is non-relevant. There may be any number of queries and the Query IDs will be in order.
1 GH43-EKD 1
1 GH43-KEC 1
1 LEIR-23O 0
2 DC7E-C8E 0
2 CJEU-390 1
ï?± Documents GH43-EKD and GH43-KEC are relevant for Query 1
ï?± Document LEIR-23O is not relevant for Query 1
ï?± Document DC7E-C8E is not relevant for Query 2
ï?± Document CJEU-390 is relevant for Query 2
ï?¬ The file containing the results has two columns. Again, the first is the Query ID. The second contains the ID of a document returned by an IR system, with the one with the highest score being first. There may be any number of queries in the file. Query IDs will be in order.
ï?± The result set for Query 1 is: G05-28-1307395 , G03-56-3077214 , G01-98-3458857 , G13-53-1377911 , G10-47-2751220 and G28-84-3654906 in that order (with document 32 having the highest score)
ï?± The result set for Query 2 is: G00-18-0557451 , G00-06-2996037 , G07-93-2434760 , G01-98-3614741 , G02-58-1998234 , G03-09-3255983 and G03-15-3550640 in that order
ï?¬ When given these two files, your program should output the result of running the bpref evaluation metric. This should be averaged over all the queries in the files.
ï?¬ It is ESSENTIAL that this can be run as a standalone program, without requiring IDEs such as Visual Studio, Eclipse, NetBeans or similar.
ï?¬ It is also ESSENTIAL that you write all of the submitted code yourself. It is not permitted to use downloaded code in this assignment.
To be posted soon.
Additional Project Description:
07/12/2012 at 20:59 SGT
Please see the file attached that says tasks.doc. It lists the rest of the three tasks. As outlined earlier I said that there are 3 tasks but now there are 5 tasks all together.
IMPORTANT: You need to know PD, Pure Data Language for one of them. If you don't know pure data then please do not apply. I have also attached my lecture notes for my students with this as well.
For everyone who has applied so far, I am willing to take new bids from them. Again the deadline for this is 20th of July.