生物
蛋白质基因组学
打开阅读框
蛋白质组学
编码
计算生物学
遗传学
肽序列
基因组
基因
基因组学
作者
Mei Yang,Yuting Xie,Lingshuo Wang,Irwin Jungreis,Tong Ou,Manolis Kellis,Jia Wang,Yafeng Zhu
摘要
Abstract Small open reading frames (sORFs) encode an emerging class of functional proteins less than 100 amino acids in length. However, sORFs are incompletely characterized in mice and humans. The development of proteomics and Ribo-seq techniques has enabled the discovery of a number of sORF-encoded peptides (SEPs), but previous proteogenomics studies have been limited to a few cell lines or tissues. Given these limitations, a potentially vast number of sORFs remains to be discovered. We collected community-scale previously published proteomics data including one billion experimental spectra derived from a wide range of mouse and human tissues in order to identify novel sORFs and reveal the tissue expression status of novel and recently annotated sORF-encoded proteins. We have detected several novel sORFs in specific tissues, including a conserved protein-coding upstream overlapping ORF in HNRNPUL2 expressed in human lymphocytes, which may hold important biological functions. This work introduces a simple and efficient filtration strategy to detect novel sORFs. Our workflow will likely prove useful for future studies on sORFs in humans and other animals.
科研通智能强力驱动
Strongly Powered by AbleSci AI