-- task -- Partial working Code in thread description --
Skills needed:
- Python and Selenium
Good to Have:
- PowerBi
Objective:
- > Build webscrap extractor to download all files (.xls, .csv, .pdf) in a webpage
Refer to the thread description details with code working partially:
[login to view URL]
Quick description:
1. Download the .xls, .csv and .pdf from the tables "Diretorios" and "Arquivos", look for the <td <i tags.
- Each link on the "Diretorio" table opens other different files on the right tab aside "Arquivo"
2. Concatenate the files extracted into a folder or panda data frame (I guess)
*** Python code will be used inside PowerBi to retrieve all the documents extracted
Website Needed:
[login to view URL]