亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Vision Transformer for Contrastive Clustering

计算机科学 特征学习 聚类分析 人工智能 模式识别(心理学) 深度学习 卷积神经网络 自编码
作者
Hua-Bao Ling,Bin Zhu,Hui Dong,Ding-Hua Chen,Chang‐Dong Wang,Jianhuang Lai
出处
期刊:Cornell University - arXiv
标识
DOI:10.48550/arxiv.2206.12925
摘要

Vision Transformer (ViT) has shown its advantages over the convolutional neural network (CNN) with its ability to capture global long-range dependencies for visual representation learning. Besides ViT, contrastive learning is another popular research topic recently. While previous contrastive learning works are mostly based on CNNs, some recent studies have attempted to combine ViT and contrastive learning for enhanced self-supervised learning. Despite the considerable progress, these combinations of ViT and contrastive learning mostly focus on the instance-level contrastiveness, which often overlook the global contrastiveness and also lack the ability to directly learn the clustering result (e.g., for images). In view of this, this paper presents a novel deep clustering approach termed Vision Transformer for Contrastive Clustering (VTCC), which for the first time, to our knowledge, unifies the Transformer and the contrastive learning for the image clustering task. Specifically, with two random augmentations performed on each image, we utilize a ViT encoder with two weight-sharing views as the backbone. To remedy the potential instability of the ViT, we incorporate a convolutional stem to split each augmented sample into a sequence of patches, which uses multiple stacked small convolutions instead of a big convolution in the patch projection layer. By learning the feature representations for the sequences of patches via the backbone, an instance projector and a cluster projector are further utilized to perform the instance-level contrastive learning and the global clustering structure learning, respectively. Experiments on eight image datasets demonstrate the stability (during the training-from-scratch) and the superiority (in clustering performance) of our VTCC approach over the state-of-the-art.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
学生信的大叔完成签到,获得积分10
1秒前
7秒前
江流儿完成签到,获得积分10
15秒前
故然完成签到 ,获得积分10
17秒前
niiiii发布了新的文献求助20
26秒前
33秒前
Jayzie完成签到 ,获得积分10
35秒前
35秒前
CipherSage应助niiiii采纳,获得10
38秒前
聪明大米发布了新的文献求助10
39秒前
朱敛发布了新的文献求助10
40秒前
酷波er应助neko采纳,获得10
43秒前
46秒前
陳.发布了新的文献求助10
51秒前
摇一摇完成签到,获得积分10
53秒前
E上电_GWJ完成签到,获得积分10
58秒前
hodi完成签到,获得积分10
1分钟前
1分钟前
1分钟前
李健的粉丝团团长应助hin采纳,获得10
1分钟前
ZZQ完成签到 ,获得积分10
1分钟前
欣嫩谷发布了新的文献求助10
1分钟前
1分钟前
orixero应助科研通管家采纳,获得10
1分钟前
熊猫应助科研通管家采纳,获得10
1分钟前
爆米花应助聪明大米采纳,获得10
1分钟前
英姑应助欣嫩谷采纳,获得30
1分钟前
1分钟前
1分钟前
neko发布了新的文献求助10
1分钟前
在水一方应助一一采纳,获得10
1分钟前
Cik发布了新的文献求助10
1分钟前
1分钟前
圆圆发布了新的文献求助10
1分钟前
TXZ06完成签到,获得积分10
1分钟前
Benhnhk21完成签到,获得积分10
1分钟前
英姑应助圆圆采纳,获得10
1分钟前
拿铁小笼包完成签到,获得积分10
1分钟前
爱学习的YY完成签到 ,获得积分10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
晶种分解过程与铝酸钠溶液混合强度关系的探讨 8888
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6425991
求助须知:如何正确求助?哪些是违规求助? 8243535
关于积分的说明 17526742
捐赠科研通 5480763
什么是DOI,文献DOI怎么找? 2894427
邀请新用户注册赠送积分活动 1870511
关于科研通互助平台的介绍 1708684