Closed

Scrape Data from 5 different websites(Need to Learn Scraping data in php and python)

Basically my father has to go to 5 different websites to go watch his online videos i want to be able to grab any information from any part of a website and more importantly

grab an array of all the items then run little loops and that to extract the first second third fourth and fifth things type of thing within the bigger array to add to the arrays to be placed in for the moment file on the computer as a html file for accessing later.

Now i want someone to teach me the fine arts of web scraping so i can put together one webpage based on what i will scrape from these websites.

This is a small thing but something i need to do both for this and an upcoming website that could be worth a bit should i get it scraped in time but its better i learn scraping for python and php so i can apply this to my own websites i can in theory use php to enter data into a mysql database that stuff is easy to do if you have the data. i can even learn to hack my own wordpress theme with that data but i need to get the data before i can do any of that plus if someone knows wordpress plugin integration that would help me with my projects.

At the moment a tutorial to scrape this website for both python and php would be appreciated main site is

With python and beautiful soup i can get down to

<div class="section-programs">

<p class="episode" data-keywords="abc3">

<a style="color:#262626" href="/programs/yoohoo-and-friends/ZX6514A015S00" title="Series 1 Ep 41 Stoney Island">YooHoo And Friends</a>

<span style="color:#6f6f6f"> - 15 episodes</span>

with the following code

from BeautifulSoup import BeautifulSoup

import requests

url ="[login to view URL]"

r = [login to view URL] (url)

soup = BeautifulSoup([login to view URL])

paragraph_number = len([login to view URL]('p', attrs={"class":"episode"})) paragraph number for looping

current_paragraph = [login to view URL]('p', attrs={"class":"episode"})[0] current paragraph

php i havent tried sucessfully to pull anything but the main site [login to view URL] but this is one of 5 or so i need to scrape and the basic information is similar to above code i need the abc3 in the datakeywords as channel the href with a base url added to that and with the above data apply it like the a text YooHoo And Friends and append the title data to that so Yoo Hoo And Friends Series 1 Ep 41 i can add and change ep to episode and the like i believe but i need to grab them oh yeah and i need to grab the span tag specifically the 15 episodes this area will create the loop number for that link for how many hidden episodes of that program there are and then every link grabbed from the first list if more then 1 episodes are in that span area then the resulting links are parsed and they go in a got links area value in the links that have more episodes the links are treated the same way they get created and the hrefs with a base url are compared to already got links if they arnt in there added and finally in the programs page there are some images i need to steal aka channel images and the like then a big list is made with a <a href ="abc links" title="abc program titles">program name series episode etc</a>

all links are then put into a file then the next site.

But this is basically what i need from someone so i can scrape pages get the main page i can do that on php and python. python i can get down to an array of paragraphs with all the info i need to get php i cant even get the first element of the items i need.

So anyone who can teach me php and python scraping i would appreciate it.

David Beams

Skills: MySQL, PHP, Python, Web Scraping

See more: wordpress hack plugin, who can help me with php online, what is big oh, what i need to create site, what can you do with php and mysql, what can i do with php and mysql, website scraping projects, website div create how, web scraping python 3, web scraping programs, web scraping part time, webpage i want to create, want to create my own database, tutorial data, this and that tutorial, site to learn how to create a website, site scraping online, scraping websites with python, scraping web content, scraping python link

About the Employer:
( 0 reviews ) faridabad, India

Project ID: #9721713

6 freelancers are bidding on average $70 for this job

mantislin

Hi sir, I am scraping expert, I have did too many similar projects, please check my feedback then you will know. Can you tell me more details? then I will provide demo data for you. Thanks, Kimi

$120 AUD in 3 days
(383 Reviews)
7.7
webxtor

Hello. I am web extractor. Can teach you. Thanks . Eugene

$55 AUD in 1 day
(290 Reviews)
6.8
womki

I have plenty of experience in web scraping in php (php is much better than python for scraping html, python works better for formatted data like xml, json,etc), if you want I can create 2 simple scripts to crawl the d More

$35 AUD in 1 day
(21 Reviews)
5.4
anujayk

I am an experience web developer having good hands on latest relevant technologies used for the front end and backend development,i am experienced in doing the similar projects,i could be results a better resource for More

$25 AUD in 1 day
(9 Reviews)
2.8
ideasrefined1

i have worked on similar projects outside freelancer.

$35 AUD in 5 days
(1 Review)
2.7
jincongho

I have done many data scraping work before, such as dictionary entries and news. Accept my bid and I'll start teaching you immediately. I can give you example code how to scrape movie websites you've mentioned and expl More

$150 AUD in 2 days
(0 Reviews)
0.0