Fix a half-done fun music program (only one or two php needs fixing) - repost

IN PROGRESS
Bids
2
Avg Bid (USD)
$43
Project Budget (USD)
$30 - $100

Project Description:
Put "Eminem" for example in the box at

http://codemaker.eunetadsl.com/freelance/mujyhane/index.php

you can see what this program does.

This program searches Youtube API for video titles and then extract information.

I will send you the program once you post a message.

The program extracts song, artist, featuring artist info from youtube titles.

But as you can see, there are many errors.

I hope you do as much as possible to clean the result.

But at least the following should be done.


1. I will make my own artists.xlsx. (You don't have to make it.)

See the attachment to this project description.

Once I click the start button, the program will start with the first artist of artists.xlsx,

and it all things are complete, then it will start extracting info about the second artist of artists.xlsx.


2. If the title contains more than one " - ", just don't include that title whatsoever.

Don't even take that into account.

Note that it is OK to contain multiple "-", but not OK to contain multiple " - ".

The current program already ignores all video titles that do NOT have " - " in the title. That's good.

But if there are more than one " - " in the title, it should be ignored.

But including "-" many times is OK.

"-" and " - " are different.

So,

title without any " - " in it => ignore
title with only one " - " => include
title with more than one " - " => ignore
title with one " - " and multiple "-" => include

3.
Eminem , Skylar Grey Lyrics (Wrong. the space)
Eminem, Skylar Grey Lyrics (correct)


4. The current version can't get the featuring artist info from the following title.

Fort Minor - Remember the Name (Ft. Eminem and 50 Cent)

Perhaps this is because it contains "Ft" rather than "ft".


5. If there is parenthesis like () [] {} then just delete the parenthesis and also delete everything in that parenthesis.

But if in the parenthesis there is artist name, then just ignore that video title and don't put that into excel.

EXCEPTION: I said the program should use artists.xlsx to know what artist to search for.

Suppose the first artist of artists.xlsx is "Eminem".

Title includes (Eminem) => that video title should not be considered in the algorithm at all.
Title include (lyrics) => do not exclude that video title but delete "(lyrics)" in the final output of song, artist, featuring artist.

And afther Eminem is complete, suppose the second artist of artists.xlsx is "Rihanna"

Title includes [Rihanna] => that video title should not be considered in the algorithm at all.
Title include (lyrics) => do not exclude that video title but delete "(lyrics)" in the final output of song, artist, featuring artist.


6. You see the program makes many "lines" with song, artist, featuring artist, and original

title.

If the program puts "Rihanna" and "Dr. Dre" in the song column,

then it means it is wrong because "Rihanna" and "Dr. Dre" are artists, not songs.

To fix this, I made artistsfix.xlsx.

In the example artistsfix.xlsx that I attached, there are Rihanna and Dr. Dre.

So if any of the line contains Rihanna or Dr. Dre, the line should be deleted.


7. Keep in mind that the purpose of the program is extracting song, artist, featuring artist info from vid titles.

I think definitely it is very very very easy to find what's wrong if you have basic common sense.

(It is very easy to see that an artist name should not contain words like "(lyrics)".)

Skills required:
PHP, Software Architecture, Web Scraping
Additional Files: artists.xlsx
About the employer:
Verified
Public Clarification Board
Bids are hidden by the project creator. Log in as the employer to view bids or to bid on this project.
You will not be able to bid on this project if you are not qualified in one of the job categories. To see your qualifications click here.


Hire gmccann
$ 35
in 2 days
Hire kairuediger
$ 50
in 3 days