Web Scraping with Python

In Progress Posted 7 years ago Paid on delivery
In Progress Paid on delivery

I need a Python Script to crawl a list of websites for security vulnerability and patch information for that given day. The sites will be sites like Cisco vulnerability database, juniper, ect. Once the data has been identified, I need to pull specific information about that sites listed vulnerabilities and paste them into an outlook email template. Instead of sending the emails. I need them to be saved in the Drafts section. After this is all completed the script should move to the next website in the “[login to view URL] website”. A breakdown of the functionality of the scrip is listed below.

Crawl the Site:

I need a Python Script to crawl a list of sites listed in a txt file.

1. Website file name is “[login to view URL]” -- Located in the same directory the script is run from.

2. Error Handling: An Error should state if this file does not exist.

I need to find vulnerabilities and patches. See the site for specific examples:

[login to view URL]

Scrap the Data:

This scraping tool should look for new vulnerabilities (on that site only – So, you don’t need to spider links associated with that website). So the criteria should be:

Date:

Vulnerability type: Critical, High, or Medium

Paste Data into Email Templates:

Paste the scraped information from the website into the designated email template. The template used will be determined from the site the data was scraped from. The name of the email template will be the root domain that is crawled. Using our example from above the template name should be “tools_cisco_com”

Date, Title of vulnerability, link, and vulnerability ID, Publication ID.

The templates will be changed over time. So, you can use tags to replace data. For example, one email template might have: “” The script would replace that with the title of the vulnerability. Each site will only have one email regardless of how many vulnerabilities are found on that site. Multiple vulnerabilities can (and should be) be listed in one email.

General Script Guidelines

The script should also be able to the following:

• I should be able to add additional sites to the [login to view URL] list without breaking the script.

• Graceful error handling. For example: If a website in the [login to view URL] does not have a corresponding email template. I should get an error explaining that.

Please comment your code well enough for other developers to edit your code in the future.

***** I need the code to be object-oriented. Each section should be placed in corresponding functions. ****** I can not accept the entire code block to be written in main. Thank you.

Python Web Scraping

Project ID: #11983303

About the project

15 proposals Remote project Active 7 years ago