I am requiring a script / webpage possibly php or other linux scripting language that will retrieve data from a webpage, parse it, then put it accordingly in a database.
The webpage will be dinamically passed to the script, the script will be called with a parameter which will then be used to call the webpage. Example: [login to view URL]<parameter passed to the script>.
The resulting page will need to be parsed for data and that data ptu into the database.
We are estimating around 10 values that we need to get from each page. The script should be perfect performance-wise.
## Deliverables
the data that we need to get (parse) is
[login to view URL]
the 'codice' part is changing, we need to be **LOGGED** to the page to get the correct price and description of every object.
We also need the script to be very fast in the parsing.
The data should then be inserted into a local MYsql, with rules of insertion.
The script will get the list of 'codice' to add from a mysql table, indicating also what they must update. (it could be also just one or two parameters).
[login to view URL]
I have added in other files a screenshot of the page, and the values that we need to get from that.
1. Categories - they need to be split in the db, slpitting char is the >.
2. Name of the product (codice)
3. Another codice
4. availability
5. Parameters- this should be saved in a form of specification -> value. Specification is the bold one, value is the one following.
6. Brand image. image should be saved into a specific directory, and the name of the file should be saved into the database. It should ignore if there's already the file, but the filename must be saved anyway.
7. Price
8. Description fields: each of those pages must be saved into full html, one table field per page. pages are descrizione, scheda tecnica, etcetera.
9. Image of the product, same as before.
Each product should so be only one record containing all of the above.
Please place your best bids, this is a STARTER work, we will also add more pages to parse in the future and more bid requests will follow, giving priority to the people we've worked already with.