计算机科学
人工智能
深度学习
人工神经网络
水准点(测量)
半监督学习
机器学习
管道(软件)
模式识别(心理学)
特征(语言学)
领域(数学)
特征学习
视觉学习
无监督学习
监督学习
语言学
发展心理学
哲学
数学
心理学
大地测量学
程序设计语言
纯数学
地理
作者
Longlong Jing,Yingli Tian
出处
期刊:Cornell University - arXiv
日期:2019-01-01
被引量:92
标识
DOI:10.48550/arxiv.1902.06162
摘要
Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer vision applications. To avoid extensive cost of collecting and annotating large-scale datasets, as a subset of unsupervised learning methods, self-supervised learning methods are proposed to learn general image and video features from large-scale unlabeled data without using any human-annotated labels. This paper provides an extensive review of deep learning-based self-supervised general visual feature learning methods from images or videos. First, the motivation, general pipeline, and terminologies of this field are described. Then the common deep neural network architectures that used for self-supervised learning are summarized. Next, the main components and evaluation metrics of self-supervised learning methods are reviewed followed by the commonly used image and video datasets and the existing self-supervised visual feature learning methods. Finally, quantitative performance comparisons of the reviewed methods on benchmark datasets are summarized and discussed for both image and video feature learning. At last, this paper is concluded and lists a set of promising future directions for self-supervised visual feature learning.
科研通智能强力驱动
Strongly Powered by AbleSci AI