REAL ESTATE LISTING DATA SCRAPER

Closed Posted Aug 14, 2012 Paid on delivery
Closed Paid on delivery

Our firm has the need for a qualified web scraper to develop a system to validate addresses and mls status’ for homes located in the united states and listed through various real estate agencies. The system should function as a web application that will be hosted at the location of our choice with a secure login and password validation system.

STEP 1. UPLOAD FILE THAT NEEDS VALIDATION:

Developer will build a mechanism that allows one of our staff members to upload a list of the current housing inventory we wish to scrub, the data set will be formatted as follows in CSV format:

FIRST, LAST, ADDRESS, CITY, STATE/REGION, POSTAL CODE

STEP 2, VALIDATION PROCESS USING DATA ON [url removed, login to view], [url removed, login to view] AND [url removed, login to view]

The web application should begin to take the data set and automatically scrub websites that are publicly available to validate the addresses and determine the following:

1. IF THE PROPERTY IS FOR SALE ([url removed, login to view] AND [url removed, login to view]) STATUS IS LISTED ON TRULIA BUT DATA IS DELAYED, [url removed, login to view] IS A BETTER RESOURCE

2. THE CURRENT LIST PRICE ([url removed, login to view]) IF LISTED

3. THE CURRENT LISTING AGENT AND REAL ESTATE COMPANY ([url removed, login to view]) IF LISTED

4. THE MOST RECENT ACTIVITY ON THE PRICE HISTORY CHART WE ONLY NEED THE MOST RECENT “DESCRIPTION” AND “SOURCE” ([url removed, login to view]) AND EXAMPLE IS HERE: [url removed, login to view]

5. IF THE MOST RECENT ACTIVITY IS “SOLD” THEN WE NEED TO FLAG THIS HOME, USE A FLAG ICON AND CREATE A SPECIAL FLAGGED ICON AND A PAGE TO ACCESS THESE – AN EXAMPLE OF A SOLD HOME IS AVAILAIBLE LIKE THESE: [url removed, login to view] (QUICKEST TO UPDATE) OR [url removed, login to view] AS “RECENTLY SOLD”

STEP 3, CREATE REPORT WITH NEW FIELDS FOR EXPORT/DISPLAY

THE FOLLOWING COLUMNS WILL BE ADDED TO OUR ORIGINAL DATA SET:

FOR SALE LISTING AGENT LISTING COMPANY LIST PRICE PRICE HISTORY RECENTLY SOLD

Y TOM WATSON COLDWELL BANKER 654,000 06/11/2009- LISTED FOR SALE - SOURCE N

N NA NA NA N

Y BILL SMITH SMITH REALTORS 148,000 N

• IF THE PROPERTY IS “RECENTLY SOLD” THE COLUMN NEEDS TO HAVE THE FOLLOWING ICON RED “Y” THAT LINKS DIRECTLY TO THE [url removed, login to view] PAGE SHOWING THE SOLD LISTING.

DATA SHOULD EXPORT TO A NEW CSV FILE THAT CAN BE OPENED IN EXCEL FOR STAFF REVIEW

AJAX JavaScript MySQL PHP Software Architecture

Project ID: #2404662

About the project

3 proposals Remote project Active Sep 25, 2012

3 freelancers are bidding on average $183 for this job

easydevelop

Hi We have big experience in Real Estates sites and sites for agents. We have done dozens sites built on RETS/IDX base. And especially with Trulia and Zillow. And already have tools to work with these services. Re More

$200 USD in 2 days
(6 Reviews)
6.3
sashamd

Web scraping experience, timely delivery. Please check PMB.

$225 USD in 9 days
(13 Reviews)
5.0
survey9706

Hello TampaCL! I am an intermediate c# programmer, but I have completed several web scraping and asp.net jobs, and I am confident that I could complete this job without much trouble. I am looking to increase the number More

$125 USD in 8 days
(0 Reviews)
0.0