随机森林
模式识别(心理学)
分类器(UML)
计算机科学
人工智能
水准点(测量)
统计分类
数据挖掘
大地测量学
地理
作者
Angshuman Paul,Dipti Prasad Mukherjee,Prasun Das,Abhinandan Gangopadhyay,Appa Rao Chintha,Saurabh Kundu
标识
DOI:10.1109/tip.2018.2834830
摘要
We propose an improved random forest classifier that performs classification with minimum number of trees. The proposed method iteratively removes some unimportant features. Based on the number of important and unimportant features, we formulate a novel theoretical upper limit on the number of trees to be added to the forest to ensure improvement in classification accuracy. Our algorithm converges with a reduced but important set of features. We prove that further addition of trees or further reduction of features does not improve classification performance. The efficacy of the proposed approach is demonstrated through experiments on benchmark datasets. We further use the proposed classifier to detect mitotic nuclei in the histopathological datasets of breast tissues. We also apply our method on the industrial dataset of dual phase steel microstructures to classify different phases. Results of our method on different datasets show significant reduction in average classification error compared to a number of competing methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI