发布文献求助

Learned representation-guided diffusion models for large-image generation

计算机科学人工智能稳健性（进化）忠诚概化理论编码模式识别（心理学）分类器（UML）图像（数学）数学电信生物化学化学统计基因

作者

Alexandros Graikos,Srikar Yellapragada,Minh-Quan Le,Saarthak Kapse,Prateek Prasanna,Joel Saltz,Dimitris Samaras

出处

期刊：Cornell University - arXiv 日期：2023-01-01

链接

arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2312.07330

摘要

To synthesize high-fidelity samples, diffusion models typically require auxiliary data to guide the generation process. However, it is impractical to procure the painstaking patch-level annotation effort required in specialized domains like histopathology and satellite imagery; it is often performed by domain experts and involves hundreds of millions of patches. Modern-day self-supervised learning (SSL) representations encode rich semantic and visual information. In this paper, we posit that such representations are expressive enough to act as proxies to fine-grained human labels. We introduce a novel approach that trains diffusion models conditioned on embeddings from SSL. Our diffusion models successfully project these features back to high-quality histopathology and remote sensing images. In addition, we construct larger images by assembling spatially consistent patches inferred from SSL embeddings, preserving long-range dependencies. Augmenting real data by generating variations of real images improves downstream classifier accuracy for patch-level and larger, image-scale classification tasks. Our models are effective even on datasets not encountered during training, demonstrating their robustness and generalizability. Generating images from learned embeddings is agnostic to the source of the embeddings. The SSL embeddings used to generate a large image can either be extracted from a reference image, or sampled from an auxiliary model conditioned on any related modality (e.g. class labels, text, genomic data). As proof of concept, we introduce the text-to-large image synthesis paradigm where we successfully synthesize large pathology and satellite images out of text descriptions.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 论文查重

更新

大幅提高文件上传限制，最高150M (2024-4-1)

更新

新增期刊收藏功能 (2024-03-23)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 深情安青上传了应助文件

刚刚; NexusExplorer的应助被lucy采纳，获得10

刚刚; 曾雪玲完成签到，获得积分20

1秒前; 徐三问完成签到，获得积分10

1秒前; 彭洪泽发布了新的文献求助10

1秒前; 小溪发布了新的文献求助10

1秒前; Young_Lee发布了新的文献求助10

2秒前; 科目三上传了应助文件

2秒前; 田様上传了应助文件

3秒前; 123发布了新的文献求助10

3秒前; 打打的应助被CC采纳，获得20

3秒前; 徐三问发布了新的文献求助10

3秒前; 笑点低蜜蜂完成签到，获得积分10

4秒前; sss312发布了新的文献求助10

4秒前; 赘婿上传了应助文件

4秒前; 由哎完成签到，获得积分10

4秒前; 大模型的应助被小赵不摸鱼采纳，获得10

4秒前; 倒背如流圆周率完成签到，获得积分0

4秒前; 小周棒棒哒发布了新的文献求助10

5秒前; Rr完成签到，获得积分10

5秒前; 安详书蝶完成签到，获得积分20

5秒前; 乐乐的应助被平常雨泽采纳，获得10

7秒前; 小马甲上传了应助文件

7秒前; adsx发布了新的文献求助10

7秒前; 美满听白发布了新的文献求助10

8秒前; 领导范儿的应助被淳于黎昕采纳，获得10

8秒前; 一条淡水鱼上传了应助文件

8秒前; 从容不可发布了新的文献求助10

8秒前; 悟小空发布了新的文献求助10

9秒前; Akim的应助被小周棒棒哒采纳，获得10

10秒前; Ata上传了应助文件

10秒前; 下北泽完成签到，获得积分10

11秒前; jiayourui上传了应助文件

11秒前; ssdddq发布了新的文献求助10

11秒前; 希望天下0贩的0上传了应助文件

11秒前; alltoowell完成签到，获得积分0

12秒前; shangxinyu完成签到，获得积分10

12秒前; 乐乐乐发布了新的文献求助10

12秒前; Oliver发布了新的文献求助10

12秒前; 阳光青烟完成签到，获得积分10

12秒前

高分求助中: 请在求助之前详细阅读求助说明！！！！ 20000; One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000; The Three Stars Each: The Astrolabes and Related Texts 900; Yuwu Song, Biographical Dictionary of the People's Republic of China 800; Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600; Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500; A radiographic standard of reference for the growing knee 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 2479441; 求助须知：如何正确求助？哪些是违规求助？ 2141958; 关于积分的说明 5461484; 捐赠科研通 1865041; 什么是DOI，文献DOI怎么找？ 927124; 版权声明 562922; 科研通“疑难数据库（出版商）”最低求助积分说明 496074

今日热心研友

个性的紫菜

一条淡水鱼

坚强的广山

寻寻觅觅呢

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：826996720【点击一键加群】如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通