Beyond Microsatellite Instability: Intrinsic Disorder as a Potential Link Between Protein Short Tandem Repeats and Cancer

微卫星 微卫星不稳定性 生物 遗传学 癌症 计算生物学 串联重复 氨基酸 基因 突变 基因组 等位基因
作者
Max A. Verbiest,Matteo Delucchi,Tugce Bilgin Sonay,Maria Anisimova
出处
期刊:Frontiers in bioinformatics [Frontiers Media SA]
卷期号:1 被引量:4
标识
DOI:10.3389/fbinf.2021.685844
摘要

Short tandem repeats (STRs) are abundant in genomic sequences and are known for comparatively high mutation rates; STRs therefore are thought to be a potent source of genetic diversity. In protein-coding sequences STRs primarily encode disorder-promoting amino acids and are often located in intrinsically disordered regions (IDRs). STRs are frequently studied in the scope of microsatellite instability (MSI) in cancer, with little focus on the connection between protein STRs and IDRs. We believe, however, that this relationship should be explicitly included when ascertaining STR functionality in cancer. Here we explore this notion using all canonical human proteins from SwissProt, wherein we detected 3,699 STRs. Over 80% of these consisted completely of disorder promoting amino acids. 62.1% of amino acids in STR sequences were predicted to also be in an IDR, compared to 14.2% for non-repeat sequences. Over-representation analysis showed STR-containing proteins to be primarily located in the nucleus where they perform protein- and nucleotide-binding functions and regulate gene expression. They were also enriched in cancer-related signaling pathways. Furthermore, we found enrichments of STR-containing proteins among those correlated with patient survival for cancers derived from eight different anatomical sites. Intriguingly, several of these cancer types are not known to have a MSI-high (MSI-H) phenotype, suggesting that protein STRs play a role in cancer pathology in non MSI-H settings. Their intrinsic link with IDRs could therefore be an attractive topic of future research to further explore the role of STRs and IDRs in cancer. We speculate that our observations may be linked to the known dosage-sensitivity of disordered proteins, which could hint at a concentration-dependent gain-of-function mechanism in cancer for proteins containing STRs and IDRs.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
优美宛菡发布了新的文献求助10
1秒前
LinlinJiang完成签到,获得积分20
1秒前
1秒前
2秒前
希望天下0贩的0应助Gniy采纳,获得10
2秒前
3秒前
白开水完成签到,获得积分10
3秒前
3秒前
微笑的芝完成签到,获得积分10
4秒前
4秒前
大真真发布了新的文献求助10
4秒前
5秒前
科研通AI6.2应助树脂小柴采纳,获得10
5秒前
5秒前
麦乐鸡完成签到 ,获得积分10
5秒前
蒋建国完成签到,获得积分10
5秒前
明理的盛男完成签到 ,获得积分10
6秒前
风趣思山发布了新的文献求助10
6秒前
6秒前
6秒前
6秒前
灵巧妙柏发布了新的文献求助10
7秒前
斯文败类应助幽默尔蓝采纳,获得10
7秒前
Luka发布了新的文献求助10
7秒前
毛毛菇炒蛋完成签到,获得积分10
8秒前
爆米花应助Leasq采纳,获得30
8秒前
hzl完成签到,获得积分10
8秒前
9秒前
9秒前
CipherSage应助ARIA采纳,获得10
10秒前
丘比特应助无辜的涵梅采纳,获得20
10秒前
pc 潮发布了新的文献求助20
10秒前
11秒前
12秒前
12秒前
12秒前
上官若男应助ZQJ采纳,获得10
13秒前
13秒前
干净的琦应助渔片枫舟叶采纳,获得30
13秒前
14秒前
高分求助中
Metallurgy at high pressures and high temperatures 2000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 1000
Relationship between smartphone usage in changes of ocular biometry components and refraction among elementary school children 800
The SAGE Dictionary of Qualitative Inquiry 610
Signals, Systems, and Signal Processing 610
An Introduction to Medicinal Chemistry 第六版习题答案 600
应急管理理论与实践 530
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6336781
求助须知:如何正确求助?哪些是违规求助? 8152438
关于积分的说明 17124334
捐赠科研通 5392110
什么是DOI,文献DOI怎么找? 2857893
邀请新用户注册赠送积分活动 1835433
关于科研通互助平台的介绍 1686034