The system of payment orders is needed to be developed. Huge amount of payment orders are provided. The requierments are:
1. A set of payment orders must be clustered into a number of clusers according to point #2.
2. Those clusters must reflect the field of a company. E.g. the Comany #1 is a financial company, the Company #2 is row materials, the Company #3 is row material company, the Company #4 is building company and so forth.
3. A number of clusters must be determined automaticaly.
4. Statistical significance of number of clusters must be provided.
5. The system must be robust, i.e. it processes correctly small, medium, and large amount of date Small, medium, and large means less 1 000 payment orders, from 1 000 to 30 000 payment orders, greater then 30 000 payment orders in a data set respectively.
6. The language of payment orders is English.
7. The time of 50 000 payment orders processing should be about an hour.
8. You are free to choose a method or methods.
9. Programming language is Python.