I need a PERL program to parse a 1 GigaByte XML file into several smaller XML files. The data file can be found at [login to view URL] ([login to view URL]) This file contains categorical information and will need to be extracted at the top most category levels. It should make about 16 separate XML files. FOR EXAMPLE: If you go to [login to view URL] you will see the categories listed on the home page. For example, "Adult", "Arts", "Business", "Computers", etc. These are known as the Top-Level Categories. The data file contains all these categories along with a tag called which are simply links to web sites under each respective category. There are sub-categories under each Top-Level category but they will simply become children within the Top-Level category XML file. They do not need to be parsed. Just written to the file under the Top-Level Category. The program must iterate the data file, see when a new Top-Level Category has started (like "Top/Adult", "Top/Arts", "Top/Business", etc.). When a Top-Level category does start the program will start writing to a file that is named the same as the Top-Level Category (like "CategoryName".xml ==> "[login to view URL]", "[login to view URL]", etc...) The program will then write out line for line each node under this category. The resulting XML files will like like a smaller version of the complete data fiile with respect to starting tags, nodes and ending tags. So the result will be: "[login to view URL]": tag tags "[login to view URL]": tag tags And so on....
## Deliverables
Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. Complete copyrights to all work purchased.
## Platform
Windows 2000