A Review of Tree‐Based Methods for Analyzing Survey Data

计算机科学 数据挖掘 树(集合论) 数据科学 计量经济学 统计 数学 数学分析
作者
Diya Bhaduri,Daniell Toth,Scott H. Holan
出处
期刊:Wiley Interdisciplinary Reviews: Computational Statistics [Wiley]
卷期号:17 (1)
标识
DOI:10.1002/wics.70010
摘要

ABSTRACT Recent advances in data complexity and availability present both challenges and opportunities for automated data exploration. Tree‐based methods, known for their interpretability, are widely used for building regression and classification models. However, they often lag behind the best supervised learning approaches in terms of prediction accuracy. To address this limitation, ensemble methods, such as random forests, combine multiple trees to improve prediction accuracy, though at the cost of interpretability. While tree‐based methods have seen extensive use in various fields, their application in the context of complex survey data has been relatively limited. This article provides an overview of the state‐of‐the‐art tree‐based approaches for analyzing complex survey data. It distinguishes methods explicitly designed for survey contexts from those adapted from other domains. The discussion covers applications in model‐assisted approaches, disclosure limitation, and small area estimation, as well as other recent methodological developments tailored to survey data. Additionally, the article explores aggregated tree models that sacrifice interpretability for improved prediction accuracy. These models, such as Bagging, Random Forests, and Boosting, are explained, along with the concept of out‐of‐bag error for model evaluation. Finally, this article provides the history and development of tree models, from their origins in regression trees to more recent Bayesian approaches, and aggregated tree models. This overview sheds light on the potential utility of tree‐based methods in survey methodology and provides insights into future research directions in this evolving field.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
liya完成签到,获得积分10
刚刚
djbj2022发布了新的文献求助10
1秒前
wxy发布了新的文献求助10
1秒前
LLN完成签到,获得积分10
2秒前
2秒前
酸奶巧克力完成签到,获得积分10
4秒前
FashionBoy应助Stuck1n采纳,获得30
5秒前
积极剑封发布了新的文献求助10
6秒前
风之子完成签到,获得积分10
6秒前
wxy完成签到,获得积分10
6秒前
小番茄完成签到,获得积分10
10秒前
科研通AI6.2应助星线采纳,获得10
12秒前
hh完成签到,获得积分10
13秒前
丘比特应助尼古拉斯采纳,获得10
18秒前
务实的如冬完成签到 ,获得积分10
19秒前
Jiaaa完成签到 ,获得积分10
19秒前
Greg应助内向悲采纳,获得10
19秒前
WHY完成签到,获得积分10
20秒前
22秒前
小野菌完成签到,获得积分10
22秒前
怕孤单的开山完成签到 ,获得积分10
23秒前
Orange应助浮浮世世采纳,获得10
25秒前
医痞子完成签到,获得积分10
27秒前
大豆发布了新的文献求助10
27秒前
小王小王完成签到,获得积分10
30秒前
36秒前
WHITE1完成签到,获得积分10
36秒前
栾松壕完成签到,获得积分10
36秒前
超帅从彤完成签到,获得积分10
39秒前
大豆发布了新的文献求助10
41秒前
XIAOLAN完成签到,获得积分10
45秒前
上官若男应助ymx采纳,获得10
47秒前
纯真的半山完成签到,获得积分10
47秒前
49秒前
50秒前
学术文献互助应助XIAOLAN采纳,获得150
50秒前
大威天龙完成签到,获得积分10
51秒前
三块钱土豆完成签到 ,获得积分10
51秒前
54秒前
大豆发布了新的文献求助10
56秒前
高分求助中
液晶指向矢仿真分析数据集 8888
Invited Discussant 63O and 64O 1000
Dr. Dirk Wiechmann on Lingual Orthodontics: Part I 888
Ideology and Meaning-Making under the Putin Regime 750
化工技术经济第五版电子版 500
Petrology and Plate Tectonics 500
Writing Systems 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6880529
求助须知:如何正确求助?哪些是违规求助? 8580181
关于积分的说明 18229959
捐赠科研通 6263549
什么是DOI,文献DOI怎么找? 3055054
关于科研通互助平台的介绍 2065338
邀请新用户注册赠送积分活动 2032715