提交
随机森林
决策树
计算机科学
分类器(UML)
独创性
利润(经济学)
集合(抽象数据类型)
人工智能
经济短缺
财务报表
机器学习
语句(逻辑)
水准点(测量)
数据挖掘
业务
经济
会计
心理学
数据库
地理
哲学
语言学
政府(语言学)
微观经济学
创造力
政治学
大地测量学
法学
社会心理学
审计
程序设计语言
作者
Byungdae An,Yongmoo Suh
标识
DOI:10.1108/dta-11-2019-0208
摘要
Purpose Financial statement fraud (FSF) committed by companies implies the current status of the companies may not be healthy. As such, it is important to detect FSF, since such companies tend to conceal bad information, which causes a great loss to various stakeholders. Thus, the objective of the paper is to propose a novel approach to building a classification model to identify FSF, which shows high classification performance and from which human-readable rules are extracted to explain why a company is likely to commit FSF. Design/methodology/approach Having prepared multiple sub-datasets to cope with class imbalance problem, we build a set of decision trees for each sub-dataset; select a subset of the set as a model for the sub-dataset by removing the tree, each of whose performance is less than the average accuracy of all trees in the set; and then select one such model which shows the best accuracy among the models. We call the resulting model MRF (Modified Random Forest). Given a new instance, we extract rules from the MRF model to explain whether the company corresponding to the new instance is likely to commit FSF or not. Findings Experimental results show that MRF classifier outperformed the benchmark models. The results also revealed that all the variables related to profit belong to the set of the most important indicators to FSF and that two new variables related to gross profit which were unapprised in previous studies on FSF were identified. Originality/value This study proposed a method of building a classification model which shows the outstanding performance and provides decision rules that can be used to explain the classification results. In addition, a new way to resolve the class imbalance problem was suggested in this paper.
科研通智能强力驱动
Strongly Powered by AbleSci AI