分割
人工智能
计算机科学
任务(项目管理)
自编码
尺度空间分割
概率逻辑
机器学习
集合(抽象数据类型)
基于分割的对象分类
生成模型
模式识别(心理学)
图像分割
生成语法
深度学习
经济
管理
程序设计语言
作者
Simon Kohl,Bernardino Romera‐Paredes,Clemens Meyer,Jeffrey De Fauw,Joseph R. Ledsam,Klaus Maier‐Hein,S. M. Ali Eslami,Danilo Jimenez Rezende,Olaf Ronneberger
出处
期刊:Cornell University - arXiv
日期:2018-01-01
被引量:233
标识
DOI:10.48550/arxiv.1806.05034
摘要
Many real-world vision problems suffer from inherent ambiguities. In clinical applications for example, it might not be clear from a CT scan alone which particular region is cancer tissue. Therefore a group of graders typically produces a set of diverse but plausible segmentations. We consider the task of learning a distribution over segmentations given an input. To this end we propose a generative segmentation model based on a combination of a U-Net with a conditional variational autoencoder that is capable of efficiently producing an unlimited number of plausible hypotheses. We show on a lung abnormalities segmentation task and on a Cityscapes segmentation task that our model reproduces the possible segmentation variants as well as the frequencies with which they occur, doing so significantly better than published approaches. These models could have a high impact in real-world applications, such as being used as clinical decision-making algorithms accounting for multiple plausible semantic segmentation hypotheses to provide possible diagnoses and recommend further actions to resolve the present ambiguities.
科研通智能强力驱动
Strongly Powered by AbleSci AI