Closed

Crawling a site

This project received 13 bids from talented freelancers with an average bid price of ₹3990 INR.

Get free quotes for a project like this
Employer working
Skills Required
Project Budget
₹1500 - ₹2000 INR
Total Bids
13
Project Description

I want to crawl and fetch information(Like Address,Phone number etc) about colleges from AICTE Website.
Specifically, These colleges are 2012-13 approved colleges and AICTE assigned unique application number to each college. i have around 20k colleges Application ids.

Below link is the place where users to search colleges and It works only in IE. Seems this is coded with Siebel Web Engine.

[url removed, login to view]+12-13+Public+Domain+New+Search+View

Sample Application ids
1-699035181
1-699035461

My Requirement is, Crawl the site for 20k Colleges application ids and generate HTML File for each college. So that i can parse that HTML File and will fetch details. I am looking only Crawling Part. If you develop a program that works for two Ids that's sufficient. You can use any Language to crawl the pages.

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online