web page parsing and download

Completed Posted Mar 21, 2009 Paid on delivery
Completed Paid on delivery

I require a solution that will parse a webpage and determine all the associated data as it relates to multi-level list box.

When you first arrive at the webpage ( [login to view URL]

), there is a list box that contains about 20 categories listed. When you click on a category, you get a list (within the same list box) of a number of sub-categories. This system continues to additional sub-categories within sub-categories, 4 or 5 levels deep.

I think the proper term for the structure is a tree with nodes, or similar. By looking at the page source I can see that it is driven by javascript (and some have suggested Ajax is used), and although there are linkable strings, the links do not show in the page source code. I need the keyword data associated with each category in the tree structure.

The process is simple enough when done manually, but very time consuming.

## Deliverables

the webpage I need parsed is

[login to view URL]

I have attached a sample of the "tree" showing several levels.

* * *This broadcast message was sent to all bidders on Saturday Mar 21, 2009 4:08:43 PM:

Note to all who have replied so far ... Now that several bidders have asked questions, I find myself confused/stumped by the style used by Google on that webpage. I am starting to wonder if the only easy way to do this task is via the Google API. My ultimate goal is to get all the keyword data. I had thought to do it myself, as I already have some generic webpage parsing tools in PHP. However, that list box threw a curve at me and I could not see how to programmatically get at the links. To add to my problems, extracted links from the tree structure do NOT work in a standard browser's address bar !!! So I will need to ... (a) find one of you who can extract the actual keyword data (by sub-category; and I think there are actually over 500 distinct categories) or (b) get an API licence and have it developed for me that way, or (c) cancel the idea and purchase the raw data from someone. I apologize for the confusion. I have done many extractions before today (some even from Google) but this one is defying me. If you think you can do this expanded version of the job, let me know, otherwise I will have to cancel the request. Richard

PHP

Project ID: #3746704

About the project

6 proposals Remote project Active Mar 24, 2009

Awarded to:

surfingtonio

See private message.

$127.5 USD in 14 days
(95 Reviews)
5.5

6 freelancers are bidding on average $81 for this job

webexpert78

See private message.

$68 USD in 14 days
(96 Reviews)
6.1
hoesoftware

See private message.

$80.75 USD in 14 days
(62 Reviews)
5.9
keavw

See private message.

$42.5 USD in 14 days
(30 Reviews)
5.2
arun008vw

See private message.

$85 USD in 14 days
(2 Reviews)
1.3
hemangrana

See private message.

$85 USD in 14 days
(0 Reviews)
0.0