Closed

Parse Wikipedia Database Dump

This project was awarded to marchent for $100 USD.

Get free quotes for a project like this
Employer working
Awarded to:
Skills Required
Project Budget
$100 - $300 USD
Total Bids
3
Project Description

Wikipedia Database Dump project:

1. Parsing [url removed, login to view] files and extracting only unique domain names. Domain should not be wikipedia.org.
2. Script should work with a file from [url removed, login to view]
3. Two params (filename -> [url removed, login to view] database settings -> sql table to insert eh urls id, domain).
All params should be in the begining of the file so we can customize ourselves.
4. Software should run on Linux and use regex or other parsing technique.



Example Domain Extraction:

[url removed, login to view] -> extract [url removed, login to view]
[url removed, login to view] -> extract [url removed, login to view]

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online