Retrieval data from website by script (SP20100918)

In Progress

** Brief description

Retrieve all companies information/addresses from a web dictonary (exhibition) and store them in a SQL database

for future processing in a marketing departement.

** Decription

Retrieve (automatically, programly, using a program) and fill-in

- SQL table company (addresses, type of business and type of companies)

- SQL table contactpersons,

from a exhibition website and store these data into a SQL database (mysql). A basic draft of the database schema is defined below.

To avail mistakes and also because of the amount of companies

(about 4k to 6k), the data retrieval should be done using a small, selfwritten program/script.

This program/script is also part of the project.

** Task 1:

Create inital database by retrieving

all companies and contact persons from the 12 "display categories" on

[url removed, login to view] (linked from linked from [url removed, login to view])

like "ICT Infrastructure", "Business IT", ..

Sample: categories "ICT Infrastructure": 594 companies

This task will fill-in the SQL-table "Company" and "ContactPerson"

Do avoid duplicates of companies. Be carefull - companies could be

in more than one categories.

** Task 2:

Qualify all retrieved companies by "Section" and "ProductCategory"

using the qualification links

ProductCategory # [url removed, login to view]

Section # [url removed, login to view]

Therefore, setup initialize first the 2 qualification tables

ProductCategory

Section

manual (?) and than set the relating ProductCategory and Section

to each company in the database:

company.PC_ID # foreign key to [url removed, login to view] with the higest level

company.SI_ID # foreign key to [url removed, login to view] with the higest level

** Project output:

- SQL-Dump with all companies, related information and qualifications like defined in the tasks.

- Program doing the retrieval of data from the website.

including a small README/HOWTO how to use this script.

** Draft Database schema

TABLE company # source: [url removed, login to view] (go throught 12 display categories)

ID

# main information

name1

name2

address1

address2

postcal code

city

country

Telephone1

Fax1

website

subject

# PertinentFacts

Type of company

AnnualTurnover

AnnualTurnoverDate

Employees

EmployeesDate

# location cebit2010

hall:

Stand:

PC_ID # foreign key to [url removed, login to view] with the higest level

SI_ID # foreign key to [url removed, login to view] with the higest level

TABLE ContactPerson # source: [url removed, login to view] (go throught 12 display categories)

ID

Name

Type

Telephone

Fax

Email

Mobile

companyId # foreign key to [url removed, login to view]

TABLE ProductCategory # source: [url removed, login to view]

ID

Name

ParentPC_ID # foreign key to [url removed, login to view], NULL if it's the root ProductCategory (no parent)

TABLE Section # source: [url removed, login to view]

ID

Name

ParentSI_ID # foreign key to [url removed, login to view], NULL if it's the root section (no parent)

Skills: Data Entry, Data Processing, Perl, PHP, Shell Script

See more: website by, telephone marketing companies, set data, php script null, one key data, how to use code to create a website, how to null php script, how to null a script, how to null a php script, how to create website on mobile, how to create website company, how to create website by php, how to create small website, how to create company website, how to create a website with database, howto create a website, foreign website company, website marketing companies, ict website, data processing website, telephone script, sql duplicates, sql database website, null, Mobile datA

About the Employer:
( 39 reviews ) Stuttgart, Germany

Project ID: #799285

Awarded to:

zeke

Please see PMB for details.

$200 USD in 1 day
(179 Reviews)
7.1

6 freelancers are bidding on average $197 for this job

srinichal

I am an expert in scrapping and can deliver the project

$180 USD in 4 days
(129 Reviews)
7.3
liviakecskes

Hello! Please check PM.

$200 USD in 5 days
(46 Reviews)
5.9
PauloSam

Hello, I can do this. Please check PM.

$150 USD in 3 days
(44 Reviews)
5.4
noobraga

Please Check PM.

$200 USD in 2 days
(3 Reviews)
3.0
benny33

check pm please, sir

$250 USD in 1 day
(0 Reviews)
0.0