data categorization in cloud computing... in this we have three diff clouds(for bank, offices, hospitals) in this project we have analyse the file and then decide whether it is bank's or hospital etc. file...then send that file into respective cloud... for this we train our system by keywords for each file type and implement using decision tree algorithm. You can use bi-gram model for string comparison.
Implement any algorithm of decision tree which are used in data mining containing nodes and leaves. You can use any decision tree algorithm (i.e. cart ,ID3-4-5, C4.5 or any other from supervised algorithm) which has higher accuracy rate only use that algorithm which has high accuracy among them. This decision tree can be able to find the accuracy of decision tree that what is the percentage of accuracy and complexity computation of tree using confusion matrix and big O’s method and calculate time. Also provide document explaining which and why you use specific algorithm and formulas to find accuracy and complexity And how many test cases and training datasets are used.
6 freelancers are bidding on average $215 for this job
Hi, I have more that 4+years of experience in hadoop and data mining technologies like HDFS, MapReduce, python, Spark, Hive etc. Please review my profile for skills Contact me for more detail