计算机科学
人工智能
代表(政治)
生成模型
图像(数学)
语义学(计算机科学)
参数化复杂度
外部数据表示
生成语法
钥匙(锁)
机器学习
模式识别(心理学)
算法
政治
计算机安全
程序设计语言
法学
政治学
作者
Brandon Trabucco,Kyle G. Doherty,Max Gurinas,Ruslan Salakhutdinov
出处
期刊:Cornell University - arXiv
日期:2023-02-07
被引量:81
标识
DOI:10.48550/arxiv.2302.07944
摘要
Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation combines simple transformations like rotations and flips to generate new images from existing ones. However, these new images lack diversity along key semantic axes present in the data. Current augmentations cannot alter the high-level semantic attributes, such as animal species present in a scene, to enhance the diversity of data. We address the lack of diversity in data augmentation with image-to-image transformations parameterized by pre-trained text-to-image diffusion models. Our method edits images to change their semantics using an off-the-shelf diffusion model, and generalizes to novel visual concepts from a few labelled examples. We evaluate our approach on few-shot image classification tasks, and on a real-world weed recognition task, and observe an improvement in accuracy in tested domains.
科研通智能强力驱动
Strongly Powered by AbleSci AI