粗集
计算机科学
特征选择
降维
人工智能
语义学(计算机科学)
数据挖掘
模糊集
特征(语言学)
模式识别(心理学)
维数之咒
模糊逻辑
还原(数学)
机器学习
数学
哲学
语言学
程序设计语言
几何学
作者
Richard Jensen,Qiang Shen
摘要
Semantics-preserving dimensionality reduction refers to the problem of selecting those input features that are most predictive of a given outcome; a problem encountered in many areas such as machine learning, pattern recognition, and signal processing. This has found successful application in tasks that involve data sets containing huge numbers of features (in the order of tens of thousands), which would be impossible to process further. Recent examples include text processing and Web content classification. One of the many successful applications of rough set theory has been to this feature selection area. This paper reviews those techniques that preserve the underlying semantics of the data, using crisp and fuzzy rough set-based methodologies. Several approaches to feature selection based on rough set theory are experimentally compared. Additionally, a new area in feature selection, feature grouping, is highlighted and a rough set-based feature grouping technique is detailed.
科研通智能强力驱动
Strongly Powered by AbleSci AI