Closed

Java Scraping - Extracting Content From The Page

This project received 6 bids from talented freelancers with an average bid price of $225 USD.

Get free quotes for a project like this
Employer working
Skills Required
Project Budget
N/A
Total Bids
6
Project Description

We are looking for a PoC app showing how to extract main content from the random html page, stripping everything else out (navigation, banners, sides, etc) .

Similar to what instapaper does with random content page.

I have attached list of random html pages covering similar topic, result application should intelligently extract only main content from the page.

!!! To be considered for the job, please outline general direction you would take

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online