Project Description:
Looking for Python scrapper to build automated web scrapping scripts. Should have sound knowledge of working on Linux based environment and prior experience in web scrapping.
Key skills required:
- "Grab" is expected to be used as the scraping module - http://pypi.python.org/pypi/grab/.
- PDFMiner - http://www.unixuser.org/~euske/python/pdfminer/index.html to be used for PDF to text extraction.
- xhtml2pdf.pisa for HTML to PDF conversion