发布文献求助

Accurate Prediction of Antifreeze Protein from Sequences through Natural Language Text Processing and Interpretable Machine Learning Approaches

抗冻蛋白人工智能支持向量机计算机科学机器学习互补性（分子生物学）序列（生物学）相似性（几何）特征（语言学）模式识别（心理学）自然语言处理化学生物图像（数学）哲学生物化学遗传学语言学

作者

Saikat Dhibar,Biman Jana

出处

期刊：Journal of Physical Chemistry Letters [American Chemical Society]
日期：2023-11-27 卷期号：14 (48): 10727-10735 被引量：2

链接

标识

DOI：10.1021/acs.jpclett.3c02817

摘要

Antifreeze proteins (AFPs) bind to growing iceplanes owing to their structural complementarity nature, thereby inhibiting the ice-crystal growth by thermal hysteresis. Classification of AFPs from sequence is a difficult task due to their low sequence similarity, and therefore, the usual sequence similarity algorithms, like Blast and PSI-Blast, are not efficient. Here, a method combining n-gram feature vectors and machine learning models to accelerate the identification of potential AFPs from sequences is proposed. All these n-gram features are extracted from the K-mer counting method. The comparative analysis reveals that, among different machine learning models, Xgboost outperforms others in predicting AFPs from sequence when penta-mers are used as a feature vector. When tested on an independent dataset, our method performed better compared to other existing ones with sensitivity of 97.50%, recall of 98.30%, and f1 score of 99.10%. Further, we used the SHAP method, which provides important insight into the functional activity of AFPs.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 科研通AI5的应助被震动的凡柔采纳，获得10

刚刚; 充电宝的应助被未雨绸缪采纳，获得10

1秒前; yanyan发布了新的文献求助10

1秒前; Nzoth完成签到，获得积分10

3秒前; 甜蜜晓绿给甜蜜晓绿的求助进行了留言

3秒前; Ava上传了应助文件

4秒前; 英姑的应助被lizhiqian2024采纳，获得10

4秒前; 科研通AI2S上传了应助文件

4秒前; 可爱的函函上传了应助文件

5秒前; 大力捕发布了新的文献求助10

5秒前; Doki完成签到，获得积分10

5秒前; Tree完成签到，获得积分10

6秒前; 666pop完成签到，获得积分10

6秒前; 乐观短靴完成签到，获得积分10

6秒前; 晴万里完成签到，获得积分10

6秒前; 柠檬精翠翠完成签到，获得积分10

7秒前; 小蘑菇的应助被hello采纳，获得10

7秒前; 情怀的应助被42采纳，获得30

7秒前; cherish发布了新的文献求助10

8秒前; 吴彦祖发布了新的文献求助10

8秒前; 一颗药顽完成签到，获得积分10

10秒前; 听雨潇潇发布了新的文献求助10

11秒前; 玛卡巴卡完成签到，获得积分10

13秒前; 充电宝上传了应助文件

13秒前; 英俊的铭上传了应助文件

14秒前; Jasper上传了应助文件

14秒前; 科研通AI5上传了应助文件

16秒前; 科研通AI5的应助被程莉采纳，获得10

16秒前; ZhouYW的应助被李荣航采纳，获得10

16秒前; Landau发布了新的文献求助10

17秒前; keko完成签到，获得积分10

17秒前; 清脆雪糕发布了新的文献求助10

18秒前; hehe发布了新的文献求助10

18秒前; 旺旺小小贝完成签到，获得积分10

18秒前; 领导范儿上传了应助文件

20秒前; 情怀上传了应助文件

20秒前; 大力的汉堡完成签到，获得积分10

20秒前; 贪玩的采珊完成签到，获得积分10

21秒前; Zxy发布了新的文献求助10

21秒前; Landau完成签到，获得积分10

22秒前

高分求助中: Encyclopedia of Mathematical Physics 2nd edition 888; Technologies supporting mass customization of apparel: A pilot project 600; 材料概论周达飞 ppt 500; Nonrandom distribution of the endogenous retroviral regulatory elements HERV-K LTR on human chromosome 22 500; Introduction to Strong Mixing Conditions Volumes 1-3 500; Optical and electric properties of monocrystalline synthetic diamond irradiated by neutrons 320; 科学教育中的科学本质 300

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3806839; 求助须知：如何正确求助？哪些是违规求助？ 3351563; 关于积分的说明 10354783; 捐赠科研通 3067340; 什么是DOI，文献DOI怎么找？ 1684500; 邀请新用户注册赠送积分活动 809737; 科研通“疑难数据库（出版商）”最低求助积分说明 765635

今日热心研友

昏睡的蟠桃

卡皮巴拉yuan

平常的毛豆

一颗西红柿

jenningseastera

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通