DeViSE: A Deep Visual-Semantic Embedding Model

计算机科学 杠杆(统计) 嵌入 人工智能 班级(哲学) 对象(语法) 可视化 视觉对象识别的认知神经科学 自然语言处理 训练集 图像(数学) 深度学习 情报检索 模式识别(心理学)
作者
Andrea Frome,Greg S. Corrado,Jon Shlens,Samy Bengio,Jeff Dean,Marc’Aurelio Ranzato,Tomáš Mikolov
出处
期刊:Neural Information Processing Systems 卷期号:26: 2121-2129 被引量:2213
链接
摘要

Modern visual recognition systems are often limited in their ability to scale to large numbers of object categories. This limitation is in part due to the increasing difficulty of acquiring sufficient training data in the form of labeled images as the number of object categories grows. One remedy is to leverage data from other sources - such as text data - both to train visual models and to constrain their predictions. In this paper we present a new deep visual-semantic embedding model trained to identify visual objects using both labeled image data as well as semantic information gleaned from unannotated text. We demonstrate that this model matches state-of-the-art performance on the 1000-class ImageNet object recognition challenge while making more semantically reasonable errors, and also show that the semantic information can be exploited to make predictions about tens of thousands of image labels not observed during training. Semantic knowledge improves such zero-shot predictions achieving hit rates of up to 18% across thousands of novel labels never seen by the visual model.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
可靠世平发布了新的文献求助10
刚刚
科研通AI6.1应助鳄鱼叁叁采纳,获得10
1秒前
xms2022发布了新的文献求助10
1秒前
Fighting完成签到 ,获得积分10
1秒前
山与草宣发布了新的文献求助10
1秒前
星辰大海应助te271828采纳,获得10
1秒前
1秒前
格格磊磊完成签到,获得积分10
4秒前
4秒前
打打应助哒哒哒采纳,获得10
4秒前
6秒前
7秒前
彭宇彬发布了新的文献求助10
7秒前
慕青应助Tzzl0226采纳,获得10
7秒前
8秒前
田様应助山与草宣采纳,获得10
8秒前
格格磊磊发布了新的文献求助10
8秒前
CodeCraft应助Henry采纳,获得10
9秒前
执着大象完成签到,获得积分10
9秒前
10秒前
CipherSage应助5151采纳,获得10
11秒前
12秒前
12秒前
小透明发布了新的文献求助30
13秒前
pin发布了新的文献求助10
13秒前
天天快乐应助科研通管家采纳,获得10
14秒前
14秒前
Michael_Jiang发布了新的文献求助10
14秒前
SciGPT应助科研通管家采纳,获得20
14秒前
霸气映之完成签到,获得积分10
14秒前
小马甲应助科研通管家采纳,获得10
14秒前
科目三应助科研通管家采纳,获得10
14秒前
FashionBoy应助科研通管家采纳,获得10
14秒前
风趣靳应助科研通管家采纳,获得10
14秒前
情怀应助科研通管家采纳,获得30
14秒前
布洛芬发布了新的文献求助10
14秒前
风趣靳应助科研通管家采纳,获得10
14秒前
14秒前
14秒前
爆米花应助科研通管家采纳,获得10
14秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 1200
Signals, Systems, and Signal Processing 610
Software that combines deep learning,3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
Adhesion Science: Principles & Practice 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6492883
求助须知:如何正确求助?哪些是违规求助? 8290418
关于积分的说明 17690956
捐赠科研通 5584892
什么是DOI,文献DOI怎么找? 2915485
邀请新用户注册赠送积分活动 1892551
关于科研通互助平台的介绍 1750821