计算机科学
人工智能
生成语法
特征学习
代表(政治)
生成模型
深度学习
自然语言
图形
实证研究
自然语言处理
自然语言理解
机器学习
语言理解
外部数据表示
数据科学
开放式研究
监督学习
自然(考古学)
中间语言
脆弱性(计算)
语言习得
半监督学习
标记数据
作者
Xiao Liu,Fanjin Zhang,Zhenyu Hou,Li Mian,Zhaoyu Wang,Jing Zhang,Jie Tang
标识
DOI:10.1109/tkde.2021.3090866
摘要
Deep supervised learning has achieved great success in the last decade. However, its defects of heavy dependence on manual labels and vulnerability to attacks have driven people to find other paradigms. As an alternative, self-supervised learning (SSL) attracts many researchers for its soaring performance on representation learning in the last several years. Self-supervised representation learning leverages input data itself as supervision and benefits almost all types of downstream tasks. In this survey, we take a look into new self-supervised learning methods for representation in computer vision, natural language processing, and graph learning. We comprehensively review the existing empirical methods and summarize them into three main categories according to their objectives: generative, contrastive, and generative-contrastive (adversarial). We further collect related theoretical analyses on self-supervised learning to provide deeper thoughts on why self-supervised learning works. Finally, we briefly discuss open problems and future directions for self-supervised learning. An outline slide for the survey is provided.
科研通智能强力驱动
Strongly Powered by AbleSci AI