发布文献求助

scELMo: Embeddings from Language Models are Good Learners for Single-cell Data Analysis

计算机科学元数据语言模型嵌入原始数据聚类分析注释功能（生物学）发电机（电路理论）领域（数学分析）人工智能自然语言处理数据挖掘机器学习程序设计语言万维网功率（物理）物理数学分析生物进化生物学量子力学数学

作者

Tian-Yu Liu,Tianqi Chen,Wangjie Zheng,Xiao Luo,Hongyu Zhao

链接

researchsquare.comdoi.org

标识

DOI：10.1101/2023.12.07.569910

摘要

Abstract Various Foundation Models (FMs) have been built based on the pre-training and fine-tuning framework to analyze single-cell data with different degrees of success. In this manuscript, we propose a method named scELMo (Single-cell Embedding from Language Models), to analyze single-cell data that utilizes Large Language Models (LLMs) as a generator for both the description of metadata information and the embeddings for such descriptions. We combine the embeddings from LLMs with the raw data under the zero-shot learning framework to further extend its function by using the fine-tuning framework to handle different tasks. We demonstrate that scELMo is capable of cell clustering, batch effect correction, and cell-type annotation without training a new model. Moreover, the fine-tuning framework of scELMo can help with more challenging tasks including in-silico treatment analysis or modeling perturbation. scELMo has a lighter structure and lower requirements for resources. Our method also outperforms recent large-scale FMs (such as scGPT [1], Geneformer [2]) and other LLM-based single-cell data analysis pipelines (such as GenePT [3] and GPTCelltype [4]) based on our evaluations, suggesting a promising path for developing domain-specific FMs.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: cjh258819发布了新的文献求助10

1秒前; cyy发布了新的文献求助10

2秒前; 科研通AI5上传了应助文件

2秒前; DIngqin发布了新的文献求助10

3秒前; 苗修杰完成签到，获得积分10

3秒前; 小艾艾麦仑完成签到，获得积分10

6秒前; 酷波er的应助被皮城小伙采纳，获得10

7秒前; 嘻嘻嘻发布了新的文献求助10

8秒前; 研友_VZG7GZ上传了应助文件

8秒前; 健壮念寒完成签到，获得积分10

9秒前; 赘婿的应助被克里斯就是逊啦采纳，获得10

10秒前; DIngqin完成签到，获得积分10

11秒前; Lucas上传了应助文件

11秒前; 余味的应助被贪玩的书雪采纳，获得10

12秒前; 小二郎的应助被SUNun采纳，获得10

12秒前; 科研通AI5上传了应助文件

13秒前; 小蘑菇上传了应助文件

14秒前; 玄叶完成签到，获得积分10

14秒前; 复杂的晓蕾发布了新的文献求助10

14秒前; Hello的应助被xxyhh采纳，获得10

15秒前; 菜籽发布了新的文献求助10

17秒前; Yunnnnn发布了新的文献求助10

18秒前; july13发布了新的文献求助10

18秒前; jenningseastera上传了应助文件

18秒前; 熬夜猫完成签到，获得积分10

18秒前; 科研通AI5的应助被SWD采纳，获得10

19秒前; HEIKU上传了应助文件

20秒前; 笨笨的鼠鼠完成签到，获得积分20

22秒前; 科研通AI5的应助被奋斗含巧采纳，获得10

22秒前; 汉堡包的应助被Brave采纳，获得10

22秒前; xueyi_102938发布了新的文献求助10

23秒前; 桐桐上传了应助文件

25秒前; 深情安青的应助被嘻嘻嘻采纳，获得10

28秒前; 天公不作美完成签到，获得积分10

30秒前; 善学以致用的应助被dfhh采纳，获得10

30秒前; 上官若男的应助被upsoar采纳，获得10

30秒前; 一颗橙子发布了新的文献求助10

31秒前; 北北完成签到，获得积分10

31秒前; 精明的代萱完成签到，获得积分10

31秒前; 科研通AI5上传了应助文件

33秒前

高分求助中: 【此为提示信息，请勿应助】请按要求发布求助，避免被关 20000; Technologies supporting mass customization of apparel: A pilot project 450; Mixing the elements of mass customisation 360; Периодизация спортивной тренировки. Общая теория и её практическое применение 310; the MD Anderson Surgical Oncology Manual, Seventh Edition 300; Nucleophilic substitution in azasydnone-modified dinitroanisoles 300; Political Ideologies Their Origins and Impact 13th Edition 260

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3781669; 求助须知：如何正确求助？哪些是违规求助？ 3327234; 关于积分的说明 10230111; 捐赠科研通 3042093; 什么是DOI，文献DOI怎么找？ 1669791; 邀请新用户注册赠送积分活动 799335; 科研通“疑难数据库（出版商）”最低求助积分说明 758774

今日热心研友

jenningseastera

平常的毛豆

忐忑的黑猫

科研小民工

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通