水准点(测量)
计算机科学
人工智能
嵌入
对象(语法)
公制(单位)
模式识别(心理学)
目标检测
机器学习
任务(项目管理)
数据挖掘
大地测量学
运营管理
经济
管理
地理
作者
Leonid Karlinsky,Joseph Shtok,Sivan Harary,Eli Schwartz,Amit Aides,Rogério Feris,Raja Giryes,Alex Bronstein
出处
期刊:Cornell University - arXiv
日期:2018-01-01
被引量:36
标识
DOI:10.48550/arxiv.1806.04728
摘要
Distance metric learning (DML) has been successfully applied to object classification, both in the standard regime of rich training data and in the few-shot scenario, where each category is represented by only a few examples. In this work, we propose a new method for DML that simultaneously learns the backbone network parameters, the embedding space, and the multi-modal distribution of each of the training categories in that space, in a single end-to-end training process. Our approach outperforms state-of-the-art methods for DML-based object classification on a variety of standard fine-grained datasets. Furthermore, we demonstrate the effectiveness of our approach on the problem of few-shot object detection, by incorporating the proposed DML architecture as a classification head into a standard object detection model. We achieve the best results on the ImageNet-LOC dataset compared to strong baselines, when only a few training examples are available. We also offer the community a new episodic benchmark based on the ImageNet dataset for the few-shot object detection task.
科研通智能强力驱动
Strongly Powered by AbleSci AI