Hire Freelancers

Craigslist scraper and parser

$250-750 USD

Cancelled

Posted

almost 15 years ago

$250-750 USD

Paid on delivery

We need a Craigslist scraper and parser with (source code; preferably python) that automatically archives multiple RSS feeds from Craigslist. Running the parsed on the scraped logfile should provide word usage frequency based on gender of poster (extracted from the w4m or m4w header), city, day of posting. The program should allow the user to choose a range of dates (extracted from the timestamps) to pull the statistics from. Outputs: (1) XML files with the archived feeds for each city and craigslist category (2) Daily CSV files listing 100 most frequent words categorized by each gender , city, and age group (excluding articles and common modifiers like "a" "an" "the" "for" etc). An example text file would look like this: header: 07-01-2009,female, atlanta, 20-25 love,112 passion,93 independent,56 caring, 46 ...... ....... CSV files should also be generated for all cities, and all age groups. Headers for these files would look like this header: 07-01-2009,female, all, 20-25 header: 07-01-2009, female, miami, all header: 07-01-2009, female, all, all

Data Processing

Project ID: 460955

About the project

1 proposal

Remote project

Active 15 yrs ago

Looking to make some money?

Email address

Benefits of bidding on Freelancer

Set your budget and timeframe

Get paid for your work

Outline your proposal

It's free to sign up and bid on jobs

1 freelancer is bidding on average $500 USD for this job

Please check my PM for details.

$500 USD in 7 days

5.0

(4 reviews)

3.6

3.6

Post a project like this

About the client

Boston, India

5.0

27

Payment method verified

Member since Jul 1, 2009

Client Verification

Other jobs from this client

Simple Piggy Project For Ruben

Convert our elegant PSD design to HTML (one page)

Illustrate a pool side scene for an ad

Business Card for A Consulting Firm

Data entry to catalog products

$2-8 USD / hour

Similar jobs

WEB SCRAPE - INSOLVENCY REGISTER UK

£10-15 GBP / hour

Python Script for Social Media Platform Data

₹1500-12500 INR

Reporting using S3, Lambda, Snowflake

Help with a Coding Assessment: Python, Java, JavaScript

$15-25 USD / hour

ESP32 enviar datos a un servidor

Noise Cancelling Headphone

WEB SCRAPE - INSOLVENCY REGISTER UK

£10-15 GBP / hour

Whatsapp Chatbot Development

Embedded Systems Experts

Chrome Extension with AI Automation

Google Docs to Editable Word Conversion

₹750-1250 INR / hour

Python Developer for ETL & Data Automation

$15-25 USD / hour

Helmet & Mask Detection System for ATMs

₹12500-37500 INR

Advanced SQL & Data Science Content Creation - 30/04/2024 19:58 EDT

€12-18 EUR / hour

Looking For make predictive and descriptive and explorative elements with tableau or Power BI -- 2

Easy PDF to Editable Word Conversion

₹750-1250 INR / hour

scraping data in python $25 -- 2

Excel Accounts Receivables Spreadsheet Design

Comprehensive Data Entry into Excel

₹12500-37500 INR

Self-Learning Automation Software Developer Needed

₹12500-37500 INR

Post a project like this

Thanks! We’ve emailed you a link to claim your free credit.

Something went wrong while sending your email. Please try again.

Loading preview

Permission granted for Geolocation.

Your login session has expired and you have been logged out. Please log in again.