I need the most recent alexa 1m database in csv and xml format with the following criteria:
- Alexa rank
- Url shortened to just "[url removed, login to view]" ex [url removed, login to view]
- category tree separated into individual columns for each sub category
- Meta title
- Meta description
- Foreign sites removed, so if the sitenames, titles, or description are not in english then the whole row should be removed.
If the categories cannot be pulled from alexa, then they should be pulled from dmoz and still fulfill the above requirements.