Crawl iTunes API and Insert into ElasticSearch

Closed Posted 4 years ago Paid on delivery
Closed Paid on delivery

I currently have a script that parses the iTunes API and puts the data into ElasticSearch and Cassandra databases. It crawls the RSS feeds twice per day. It checks iTunes for new Podcasts every day as well. Here is an example of an RSS feed that it parses.

[login to view URL]

So there are Podcasts which are like audio shows. And then each Podcast has multiple [login to view URL] other words each Podcast has one RSS feed and each RSS feed shows the episodes for that podcast sorted by newest release date first.

The current developer of the script is not very responsive to making changes. So your job is to

1 - There are some parse errors for some of the podcast rss feeds.

2 - We are missing a lot of podcasts from iTunes. We can get some of those from another websites API.

3 - Setup data for each podcast regarding how often they release new episodes. We can determine their frequency by just looking at the RSS feed and storing the frequency in the database. For example for those that have a frequency of once per day or multiple times per day we should crawl every hour of the day. For those that are once per week we should crawl maybe 4 times per day etc...

I will give you the code so you can understand it and also talk with one of my other engineers who knows how it works also.

The code is written in python. You must also show me expertise in elasticsearch

Thank you

Python Node.js Elasticsearch Cassandra Podcasting

Project ID: #20952504

About the project

9 proposals Remote project Active 4 years ago

9 freelancers are bidding on average $367 for this job

zekovicm

Hi there,I am Web Scraping expert from Bosnia & Herzegovina,Europe. I have carefully gone through with your requirements and I would like to help you with this project ! I can start immediately and finish it within the More

$322 USD in 7 days
(29 Reviews)
6.1
liveexperts123

Hi there, I have read your project description and i'm confident i can do this project for you perfectly.I still have a few questions. please leave a message on my chat so we can discuss the budget and deadline of the More

$400 USD in 3 days
(16 Reviews)
5.9
whiteeagle0001

Hello, How are you? I have read your description in more detail and have much interest in your project. So I think that I can finish your work perfectly as you need.I have many experiences for Node js. If you need to More

$400 USD in 7 days
(20 Reviews)
4.6
zeke

I have lots of experience writing web automation scripts using scrapy and with elasticsearch too. Available to start immediately and finish as soon as possible. Please contact to discuss details if you are interested. More

$250 USD in 7 days
(13 Reviews)
4.9
umairkaramat24

Hello There. How are you doing? I have read the description, I have great experience doing similar jobs related to these skills Cassandra, Elasticsearch, node.js, Podcasting, Python. Please start the chat so we can hav More

$280 USD in 13 days
(6 Reviews)
3.3
sharktiger

Good day! I'm a licensed full stack programming developer and designer. I have many experiences in python/Django and python selenium webscraping and python image processing by using python openCV package. I have many More

$250 USD in 7 days
(2 Reviews)
3.2
jaymaninfotech2

JAYMAN INFOTECH PVT LTD is a contemporary Website design and development company with a focus on user-centered design while helping our clients achieves the desired result. we are a custom software development company More

$850 USD in 45 days
(0 Reviews)
0.0