I am interested in scrapping data from: [url removed, login to view] or directly [url removed, login to view]
( this is the Romanina location pf public Jurisprudence)
WHAT HA S TO BE DONE
1. Make a basic search using the key letter "a" like in the next picture: ( push the button "Cauta" - meaning "search")
( this is in order to have almost all the items listed in the results list). ( Fig 1 is very intuitive)
2. You will obtain the search results: 227513. These search result list is the target.
Push the "Data" link in order to have the results ordered by date.
( see Fig 2 )
[url removed, login to view] entry is an item that we should scrap. ( fig 3)
[url removed, login to view] for what to scrap in separate fields you have in the next picture (fig 4)
The separate fields are:
a. Name: Hotararea
b. Number ( where is possible just the number - we should eliminate the rest of the information)
c. Date ( 2009-09-21)
d. Field CA - in this example - there are few alternatives ( T, J etc) separate field
e. Field IASI ( in this example) - separate field
f. Field 'Contencios administrativ si fiscal' in this example - separate field
g. Field 'conflict de competenta' in this example - separate field
h. Field 'Fond' in this example - separate field
After scrapping this basic data please press the number of the ITEM ( 18/CA) in fig.4, and a word document is to be open.
The text in this document has to be scrapped as an individual.
The layout of the text has to be preserved. Is not usefull if the text is parsed in a long listing word array.
The parsing instrument has to be available in the future if the site will upload with new items in the future.
Always by making the selection after Data the last introduced items willl be at the beggining.
17 freelancers are bidding on average $11/hour for this job
Hi, i've pretty much experience in such a projects, including parsing Office files. Only question is how do you want the output data saved? I can start right now. Will take about 5-10 hours.