网页
计算机科学
人工智能
相关性
特征(语言学)
班级(哲学)
监督学习
半监督学习
数据挖掘
机器学习
模式识别(心理学)
判别式
线性判别分析
数学
人工神经网络
语言学
几何学
万维网
哲学
作者
Xiao‐Yuan Jing,Fei Wu,Xiwei Dong,Shiguang Shan,Songcan Chen
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence
[Association for the Advancement of Artificial Intelligence (AAAI)]
日期:2017-02-12
卷期号:31 (1)
被引量:31
标识
DOI:10.1609/aaai.v31i1.10741
摘要
Webpage classification has attracted a lot of research interest. Webpage data is often multi-view and high-dimensional, and the webpage classification application is usually semi-supervised. Due to these characteristics, using semi-supervised multi-view feature learning (SMFL) technique to deal with the webpage classification problem has recently received much attention. However, there still exists room for improvement for this kind of feature learning technique. How to effectively utilize the correlation information among multi-view of webpage data is an important research topic. Correlation analysis on multi-view data can facilitate extraction of the complementary information. In this paper, we propose a novel SMFL approach, named semi-supervised multi-view correlation feature learning (SMCFL), for webpage classification. SMCFL seeks for a discriminant common space by learning a multi-view shared transformation in a semi-supervised manner. In the discriminant space, the correlation between intra-class samples is maximized, and the correlation between inter-class samples and the global correlation among both labeled and unlabeled samples are minimized simultaneously. We transform the matrix-variable based nonconvex objective function of SMCFL into a convex quadratic programming problem with one real variable, and can achieve a global optimal solution. Experiments on widely used datasets demonstrate the effectiveness and efficiency of the proposed approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI