This project received 5 bids from talented freelancers with an average bid price of €2422 EUR.

Get free quotes for a project like this
Project Budget
€750 - €1500 EUR
Total Bids
Project Description

1- Scope:

Providing with a basic Hadoop Map-R V.3 environment over Amazon Web Services. Basic trial environment in this phase. No need to provide 24 x 7 tools or extra code.

Main aim is to analyse data from several text S3 input sources and start trial period.

2- Tools:

We provide Project AWS account for the Project and Map-R V.3 Hadoop clusters. Free administration for implementing this project.

3- Deliverables:

- Scripts code for AWS API based automatic MAP-R V.3 Set-up for a given number of masters and computing nodes.

- Set up scripts capable of using EC2 on “demand nodes”

o For real time 24x 7 live queries

o For batch night processes.

- Java basic code for providing basic routines like:

o Joints tables form several text sources.

o Gauss statistics: Mean, deviation, etc.

o Basic counting and basic mathematics routines.

o Output text or Mysql computed tables.

- skype sessions for 4 hours to train skilled informatics from de php and javascript world.

- Documented source code.

4- Input sources:

The project is intended for analysing and creating logs joints form distant connected devices and central text tables.

- Several TEXT files for remote devices stored on S3 files.

o Characteristics of remote devices (>[url removed, login to view] TV sets)

• Brand

• Programed parameters

• Available channels o

• Geo location

o Log text of distant

• Real time logging of visits

• Number of visits

• Duration

• TV station tuned in in each moment

• Type home demographics where the device is installed.

o TV Stations programming scheduling

• Show type: movie, talk show, debate

• Start time, end time.

• Celebrities involved in the show.

6- Expected outputs.

- Several combinations of the above.

- - Mean time per TV set type expend in each type of show.

o Mean time

o Standard deviation

o Top celebrities watched

- Samples of joints form several sources.

- Real time queries set up in case of need real time response.

- Batch set up for long time consuming queries of whole set of queries.

7- Time table.

- Needed in four weeks / January end – first September week.

- We provide AWS zone with all the text sources inside ready for use.

- Week days 9- 18h CET e-mail /skype contact for immediate support for any doubt or clarification needs.

8- References:

- No project will be awarded without clear and outstanding references on hadoop implantations over AWS ,

- MAP-R is a plus.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online