Project MySql Workbench

CLOSED
Bids
5
Avg Bid (EUR)
224
Project Budget (EUR)
€100 - €250

Project Description:
We are an educational company and want the below project for one of our clients
Our customer’s details:

During this course, you will develop an information portal on a topic of your choice based on focused crawling technology. A focused crawler is a specialized crawler which "learns" a set of target topics from user-provided training data and is then able to automatically classify web pages based on their content (both based on structural and content-related features of the web pages that it finds). The web pages that are classified into your topics of interest should be indexed by Apache's Lucene search engine and be accessible by a user via regular keyword searches. Optionally, you may want to enable the user to browse the crawled topics according to your topics of interest ("topic exploration"), or further cluster the documents according to their contents ("faceted search"). See the Weka library below for more data mining tools.

Detailed descriptions of the architecture of focused crawlers are available via the above research papers.

A demo of the BINGO! focused crawler is available for download from the following URL:
http://www.mpi-inf.mpg.de/departments/d5/software/bingo/

A suggested topic for building your information portal is the computer science domain. If you choose this domain, you may consider crawling and classifying the homepages of computer-science researchers and their publications (which are usually available as PDF files). DBLP, for example, is a very good source for seed URLs in this domain: http://www.informatik.uni-trier.de/~ley/db/

JAVA PACKAGE STRUCTURE

The file FocusedCrawler.zip provides a predefined Java package structure and several abstract classes which should serve as the basis for your implementation of the project. The preferred way to edit and compile the Java sources is probably to use the Eclipse IDE (http://www.eclipse.org/). You need to add the three Jar files in the ./lib directory to your Java classpath in order to compile the sources.

Please read the whole project in the docx file attached

Skills required:
Academic Writing, MySQL, Report Writing, Software Architecture, Technical Writing
Additional Files: FocusedCrawler%281%29%281%29.zip Project+MySql+Workbench.docx
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


€ 250
in 4 days
€ 183
in 3 days
€ 157
in 8 days
€ 309
in 12 days
€ 220
in 15 days