Cancelled

Wikipedia Scraper

I need a small standalone desktop application to scrape information from Wikipedia. To apply for this job you should have a strong experience with Wikipedia data mining.

The software should do the following:

1. The software should have an URL input box. the user should be able to copy and paste several Wikipedia URLs into the input box. The software will go to wikipedia and extract the text and images from Wikipedia.

2, It will save the article content from a single URL and images into a Word document and save. It will do this for all the urls separately.

3. Scrape the databox on the right hand side of the page. see [url removed, login to view] and check out the "Great White Pelican" databox on the right. I need to have the information saved to a table and added tot the scrape text.

4. It will save all the images from each URL to a separate folders and named each folder with the URL title where the images came from.

5. Search and remove all Wikipedia internal link numbers like [12].

After this project. I will do another project that will combine this information into a database that is easy to search which will be used for an app.

feel free to suggest the best method to do this.

happy bidding.

Skills: C Programming, PHP, Software Architecture

See more: need wikipedia, need wikipedia page, programming wiki, job wikipedia, tot, php wikipedia, pelican, programming data mining, standalone desktop app, mining text data, single page app php, app check content, extract images url, extract app data, easy wikipedia, images word app, column added table, save images word, extract search box, combine link, separate chaining hash table, small data mining project, software project data mining, standalone php application, text wikipedia

About the Employer:
( 18 reviews ) Scherpenheuvel, Ireland

Project ID: #4683441

6 freelancers are bidding on average $465 for this job

miniric3

Kia ora! On2itonline.com are a NZ based web and software design company who have seen you here on the freelance market and are really excited about working with you and treating you to the full service, professional ex More

$721 USD in 21 days
(5 Reviews)
4.9
KrazyKoder

Hi, I'm interested and can do this job. Thanks

$315 USD in 5 days
(17 Reviews)
4.7
Blackhatwarrior

Please check PMB !

$250 USD in 3 days
(8 Reviews)
3.7
johan777

i am interested., please check pm., thanks

$350 USD in 10 days
(1 Review)
1.8
shiyaem

Hi I can do this project. Please see PMB

$555 USD in 3 days
(0 Reviews)
0.0
yihsan

Let's start....

$700 USD in 21 days
(0 Reviews)
0.0
SolidCoding

We have a solid experience with Web Scraping and Data Mining, with previous experience in similar projects. We can absolutely deliver this Scraper with the required specs. Please check our PM. Cheers.

$250 USD in 4 days
(0 Reviews)
0.0
vikasglobus13

We have wikipedia scraper ready for you. See This : http://www.botguruz.com/wikipedia-data-extractor , please check PMB for more details.

$333 USD in 3 days
(0 Reviews)
0.0