In Progress

Perl Document Retreival

The information of the project is given below :

Task Description

You will be provided with the files [url removed, login to view], [url removed, login to view], [url removed, login to view] and The file [url removed, login to view](can't be uploaded since greater than 1mb) contains a collection of documents which record publications in the CACM (Communications of the Association for Computing Machinery). Inspect the file and you will see that the text of each document comes enclosed within (XML-style) open and close document tags, where the open tag also specifies a numeric identifier for the document. Each document is a short record of a CACM publication, including its title, author(s), and abstract — although one or other of these (especially abstract) may be absent for a given document. You are required to write two separate programs (Perl scripts): (i) one program that computes an inverted index for the document collection, and (ii) a second program which loads this inverted index and uses it to do retrieval.

The assignment is uploaded.

Skills: Perl

See more: n.c. machinery, cgi communications, author-it, abstract tag, absent , numeric, machinery, information retrieval project, program inverted index, inverted index program, retrieval, xml assignment, short programs, enclosed files, text document, documents required project, information collection, write information retrieval, project abstract, short document, record programs, assignment programs, collection documents, record files, write project files

About the Employer:
( 0 reviews ) Sheffield, United Kingdom

Project ID: #214049

Awarded to:


I am experianced perl programmer from past 5 yrs. I can do this in a short period of time

$75 USD in 4 days
(0 Reviews)