清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

Feature Selection and Feature Stability Measurement Method for High-Dimensional Small Sample Data Based on Big Data Technology

特征选择 计算机科学 数据挖掘 最小冗余特征选择 聚类分析 特征(语言学) 人工智能 模式识别(心理学) 大数据 理论(学习稳定性) 选择(遗传算法) 样品(材料) 冗余(工程) 机器学习 操作系统 哲学 色谱法 化学 语言学
作者
Chenguang Huang
出处
期刊:Computational Intelligence and Neuroscience [Hindawi Limited]
卷期号:2021: 1-12 被引量:6
标识
DOI:10.1155/2021/3597051
摘要

With the rapid development of artificial intelligence in recent years, the research on image processing, text mining, and genome informatics has gradually deepened, and the mining of large-scale databases has begun to receive more and more attention. The objects of data mining have also become more complex, and the data dimensions of mining objects have become higher and higher. Compared with the ultra-high data dimensions, the number of samples available for analysis is too small, resulting in the production of high-dimensional small sample data. High-dimensional small sample data will bring serious dimensional disasters to the mining process. Through feature selection, redundancy and noise features in high-dimensional small sample data can be effectively eliminated, avoiding dimensional disasters and improving the actual efficiency of mining algorithms. However, the existing feature selection methods emphasize the classification or clustering performance of the feature selection results and ignore the stability of the feature selection results, which will lead to unstable feature selection results, and it is difficult to obtain real and understandable features. Based on the traditional feature selection method, this paper proposes an ensemble feature selection method, Random Bits Forest Recursive Clustering Eliminate (RBF-RCE) feature selection method, combined with multiple sets of basic classifiers to carry out parallel learning and screen out the best feature classification results, optimizes the classification performance of traditional feature selection methods, and can also improve the stability of feature selection. Then, this paper analyzes the reasons for the instability of feature selection and introduces a feature selection stability measurement method, the Intersection Measurement (IM), to evaluate whether the feature selection process is stable. The effectiveness of the proposed method is verified by experiments on several groups of high-dimensional small sample data sets.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
看看文章完成签到 ,获得积分10
18秒前
精壮小伙完成签到,获得积分0
29秒前
秋夜临完成签到,获得积分10
31秒前
契合发布了新的文献求助30
40秒前
木可完成签到,获得积分10
1分钟前
福尔摩曦完成签到,获得积分10
1分钟前
X519664508完成签到,获得积分0
1分钟前
小红书求接接接接一篇完成签到,获得积分10
1分钟前
hmhu完成签到,获得积分10
1分钟前
沉沉完成签到 ,获得积分0
1分钟前
hmhu发布了新的文献求助10
1分钟前
贝贝完成签到,获得积分0
2分钟前
2分钟前
Tong完成签到,获得积分0
2分钟前
shikaly完成签到,获得积分0
2分钟前
妄语发布了新的文献求助10
2分钟前
Wang发布了新的文献求助10
2分钟前
Wang完成签到,获得积分10
2分钟前
墨水完成签到 ,获得积分10
2分钟前
研友_shuang完成签到,获得积分0
2分钟前
3分钟前
Dream完成签到,获得积分10
3分钟前
白白嫩嫩完成签到,获得积分10
3分钟前
巫巫巫巫巫完成签到 ,获得积分10
4分钟前
ssl完成签到 ,获得积分10
4分钟前
4分钟前
艾比西地完成签到 ,获得积分10
4分钟前
WILSON发布了新的文献求助10
4分钟前
堇笙vv完成签到,获得积分0
4分钟前
jlwang完成签到,获得积分10
4分钟前
三人水明完成签到 ,获得积分10
4分钟前
Patrick完成签到 ,获得积分10
5分钟前
CC完成签到,获得积分0
5分钟前
guantlv完成签到,获得积分10
5分钟前
cai白白完成签到,获得积分0
5分钟前
6分钟前
舒服的幼荷完成签到,获得积分10
6分钟前
炎炎夏无声完成签到 ,获得积分10
6分钟前
袁国锋完成签到 ,获得积分10
6分钟前
khaosyi完成签到 ,获得积分10
6分钟前
高分求助中
A pan-cancer cuproptosis signature predicting immunotherapy response and prognosis 1500
Straight Talk about ADHD in Girls: How to Help Your Daughter Thrive 1100
Lorenz Luthi - The Regional Cold Wars in Europe, East Asia, and the Middle East Crucial Periods and Turning Points 1000
Models of Teaching(The 10th Edition,第10版!)《教学模式》(第10版!) 800
Full waveform acoustic data processing 500
More Activities for Teaching Positive Psychology A Guide for Instructors 330
The Chicago Manual of Style, 18th Edition 300
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2887826
求助须知:如何正确求助?哪些是违规求助? 2507827
关于积分的说明 6789567
捐赠科研通 2183623
什么是DOI,文献DOI怎么找? 1160831
版权声明 586630
科研通“疑难数据库(出版商)”最低求助积分说明 569371