Completed

Scrape football match stats

This project was successfully completed by lkhelladi for €122 EUR in 2 days.

Get free quotes for a project like this
Employer working
Completed by:
Skills Required
Project Budget
€30 - €250 EUR
Completed In
2 days
Total Bids
48
Project Description

Dear Freelancers,

There is a site [url removed, login to view], on which you can find past match results for several football leauges (for example matches from yestreday: [url removed, login to view] ;scroll down a bit till section "past match results"). For every match you can press (on the right, picture of some chart) "match stats for...". If you click on it, it take you to site like this (example for some match) [url removed, login to view]

And this is where I want to scrape from. I want to scrape most of details from 3 atteched pictures (Scrape1,2,3).

From every match I would like to get 2 units,observations, which means I want to scrape data seperatly for each team. So lets take 1st observation.

From "scrape1" I want to scrape:

- Team (Atlético Huila)

- Country (Colombia)

- Opponent (Rionegro Águilas)

- Competition (Primera A)

- Date (18/09/2016)

- Venue (Home) <-(if written first, otherwise "Away")

- HT result_Team (0)

- HT result_Opponent (1)

- FT result_Team (1)

- FT result_Opponent (2)

From "scrape2" I want to scrape everyting, except goals, because I already did from "scrape1". I would like to scrape stats for both teams for each of 2 observations, just at first stats from 1st team are taken for 1st set of variables (PenaltyT, BallPosessionT,AttacksT,...), where T means Team, and stats for other team are for 2nd set of variables (PenaltyO, BallPosessionOAttacksO), where O means opponent.

There is also file "scrape3". data I need from there is little more complicated. I would need to solve and scrape number of minutes of a match that Team played in particular result, which would be presneted by 5 variables ("mB2";-> 2 or more goals behind, "mB1";-> 1 goal behind,"mDraw"; -> Draw "mA1";-> 1 goal ahead, "mA2";-> 2 or more goals ahead). If you don't understand tottaly what I mean, please ask and I will explain better. I am not sure yet how should be done to compute that from this kind of data, but I belive it is more then possible. Minutes should be computed as first half last 46minutes and second 48minutes. Same as this, I would like to scrape minutes of a match that Team had 1 player more on a pitch or 1 or more less, same as before, but now computing with red cards, which means 1 player is sent off.

This is it. I don't know how all this scraping works, if it can be done in R (only programm with excel that I know doing in it), or by any other way, but most important it is that when scrapped it will look like in the excel file ("Data"), prepared for analysis. Also, if it is possible that scraping is done for all matches at once, so I don't need to open all of them, but if not, it is Ok anyway, as I said I don't now how it works:)

For any further question, just click me on chat.

Thank you!

Nejc

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online