Create a PHP function to crawl web site, find and save specific section of content

This project received 15 bids from talented freelancers with an average bid price of $164 USD.

Get free quotes for a project like this
Project Budget
$30 - $250 USD
Total Bids
Project Description

Create a PHP function (preferably using cURL) to crawl the SEC website to find the appropriate 10-k form (the annual report) for different companies and extract/save only Section 7, which is commonly referred to as the Management’s Discussion and Analysis. You are free to use my working PHP/cURL code in the attached pdf to get to the appropriate 10-k document, or start from scratch if you prefer. This project must use PHP.

A more detailed description is attached, along with the first few pages of what the sample output should look like. The attached spreadsheet contains a list of the companies this script will need to run agains, using the stock symbol and year as the input. In my initial research, I found that the 10-k's are reasonably structured but not always. I found that not all companies have a 10-k and some 10-k's you find won't have a Section 7, so please account for some error handling where appropriate.

I will ultimately be writing the text you find in Section 7 to a database, but you can also simply write the content to a txt file using the stock symbol as the file name (e.g., [url removed, login to view]).

Please only respond if you know PHP, have positive reviews and have worked on similar projects. Thank you!

Skills Required

Looking to make some money?

  • Set your budget and the timeframe
  • Outline your proposal
  • Get paid for your work

Hire Freelancers who also bid on this project

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online