Closed

hadoop project

Phase 2: Implement MR programs to solve unstructured data problems on the HDFS set up. (25

points) Due date: 4/17/2021. In this phase you will implement the word co-occurrence MR algorithm

discussed in the Lin and Dyer’s book. You’ll select a data set from publications in any subject area you

are familiar with and prepare co-occurrence or co-author information from the publications. The stripes

method for co-occurrence may be better suited for this application. Map will have to parse and drop the

extra text in the publications. We need only the first author as key and rest of the authors as value and

number of occurrences in a given corpus.

Input: Many publications from an author.

Output: Author as the key and value is the associated array with the co-authors along with number of

occurrences as entry in the associated array.

Mandatory requirement: Every team has to have its own data set and cannot copy each other.

Skills: Hadoop, Python, Map Reduce

See more: pintos project 2 user programs github, project sunshine phase 2, buddhist project sunshine phase 2 report on may 24, buddhist project sunshine phase 2, project jasper phase 2, national standards project phase 2 pdf, project ubin phase 2, project seabird phase 2 was started in which of the following state, doha metro project phase 2, sanhita housing project phase 2, igd expansion project phase 2, jabal omar development project phase 2, hazel avenue widening project phase 2, qusahwira phase-2 development project, bharat net project phase 2, hamad port project phase 2, project management phase 2, handri neeva project phase 2 map, national standards project phase 2 functional communication training, project code phase 2

About the Employer:
( 0 reviews ) Buffalo, United States

Project ID: #29921525

1 freelancer is bidding on average $88 for this job

msavinash1139

Hi, I have worked on several MapReduce based problems in Hadoop and would like to assist you on this project. Let me know soon if you would like to collaborate with me.

$88 USD in 7 days
(10 Reviews)
4.1