We can download DMOZ database at http://rdf.dmoz.org. The database is in RDF/XML data which is very large, currently over 1,8 GB in size (260MB in a zipped file distributed by [url removed, login to view]). This file contains over 590,000 categories and 4,530,823 web links.
I want someone to extract the entire urls category wise in text files. After extracting the urls output should be in [url removed, login to view] but not the inner pages links like [url removed, login to view] .