CodonBERT: a BERT-based architecture tailored for codon optimization using the cross-attention mechanism
文献类型: 外文期刊
第一作者: Ren, Zilin
作者: Ren, Zilin;Jiang, Lili;Di, Yaxin;Zhang, Dufei;Jiang, Qiwei;Zhou, Bo;Ren, Zilin;Jiang, Lili;Zhang, Dufei;Fu, Zhiguo;Sun, Pingping;Di, Yaxin;Gong, Jianli;Gong, Jianting;Ni, Ming
作者机构:
期刊名称:BIOINFORMATICS ( 影响因子:4.4; 五年影响因子:7.6 )
ISSN: 1367-4803
年卷期: 2024 年 40 卷 7 期
页码:
收录情况: SCI
摘要: Motivation Due to the varying delivery methods of mRNA vaccines, codon optimization plays a critical role in vaccine design to improve the stability and expression of proteins in specific tissues. Considering the many-to-one relationship between synonymous codons and amino acids, the number of mRNA sequences encoding the same amino acid sequence could be enormous. Finding stable and highly expressed mRNA sequences from the vast sequence space using in silico methods can generally be viewed as a path-search problem or a machine translation problem. However, current deep learning-based methods inspired by machine translation may have some limitations, such as recurrent neural networks, which have a weak ability to capture the long-term dependencies of codon preferences.Results We develop a BERT-based architecture that uses the cross-attention mechanism for codon optimization. In CodonBERT, the codon sequence is randomly masked with each codon serving as a key and a value. In the meantime, the amino acid sequence is used as the query. CodonBERT was trained on high-expression transcripts from Human Protein Atlas mixed with different proportions of high codon adaptation index codon sequences. The result showed that CodonBERT can effectively capture the long-term dependencies between codons and amino acids, suggesting that it can be used as a customized training framework for specific optimization targets.Availability and implementation CodonBERT is freely available on https://github.com/FPPGroup/CodonBERT.
分类号:
- 相关文献
作者其他论文 更多>>
-
DeepPFP: a multi-task-aware architecture for protein function prediction
作者:Bo, Xiaochen;Xue, Jiguo;Ni, Ming;Wang, Han;Sun, Jinghong;Gao, Jingyang;Ren, Zilin;Chen, Yongbing;Ren, Zilin;Chen, Yongbing
关键词:protein function prediction; SARS-CoV-2; deep learning; meta learning
-
Single-cell transcriptome atlas of lamprey exploring Natterin- induced white adipose tissue browning
作者:Pang, Yue;Du, Zeyu;Zhang, Jin;Lu, Jiali;Li, Jun;Dong, Xinrui;Zhao, Zhisheng;Chuan, Shunqin;Sun, Mingjie;Li, Qingwei;Pang, Yue;Du, Zeyu;Zhang, Jin;Lu, Jiali;Li, Jun;Dong, Xinrui;Zhao, Zhisheng;Chuan, Shunqin;Sun, Mingjie;Li, Qingwei;Qin, Yating;Liu, Qun;Han, Kai;Yuan, Zengbao;Pan, Shanshan;Xu, Mengyang;Wang, Dantong;Li, Zhen;Chen, Yadong;Song, Yue;Zhan, Liping;Cui, Wei;Wang, Jun;Fan, Guangyi;Qin, Yating;Liu, Qun;Han, Kai;Fan, Guangyi;Qin, Yating;Liu, Qun;Han, Kai;Song, Yue;Qin, Yating;Fan, Guangyi;Yuan, Zengbao;Xu, Mengyang;Wang, Dantong;Gu, Ying;Yang, Huanming;Xu, Xun;Liu, Xin;Fan, Guangyi;Xu, Mengyang;Fan, Guangyi;Li, Shuo;Zhang, Zhe;Ni, Ming;Jia, Xiaodong;Xia, Zhangyong;Yue, Zhen;Fan, Guangyi;Gu, Ying;Yang, Huanming;Xu, Xun;Liu, Xin
关键词:
-
Oocytes maintain low ROS levels to support the dormancy of primordial follicles
作者:Qin, Shaogang;Chi, Xinyue;Zhu, Zijian;Gao, Meng;Zhao, Ting;Zhang, Jingwen;Zheng, Wenying;Chen, Ziqi;Zhou, Bo;Xia, Guoliang;Wang, Chao;Chen, Chuanhe;Zhang, Tuo;He, Meina;Zhang, Lifan;Wang, Wenji
关键词:ferroptosis; primary ovarian insufficiency; primordial follicle; ROS; SOD1
-
Rapid sequencing and identification for 18-STRs long amplicon panel using portable devices and nanopore sequencer
作者:Zhang, Jiarong;Yang, Tingting;Shi, Linyu;Yan, Jiang-wei;Ni, Ming;Zhang, Jiarong;Yang, Tingting;Xie, Zihan;Yan, Jiang-wei;Ni, Ming;Zhang, Jiarong;Yang, Tingting;Shi, Linyu;Yan, Jiang-wei;Xie, Zihan;Ren, Zilin;Ren, Zilin;Ren, Zilin
关键词:Short tandem repeat (STR); Long-amplicon; Multiplex amplification; Rapid human identification; Nanopore sequencing; Portable devices
-
An opensource indoor climate and yield prediction model for Chinese solar greenhouses
作者:Zhou, Bo;Wang, Nan;Yang, Qichang;Zhou, Bo;Wang, Nan;Yang, Qichang;Zhou, Bo;Lastiri, Daniel Reyes;van Henten, Eldert J.
关键词:Chinese solar greenhouse; Climate model; Tomato yield prediction; Sensitivity analysis
-
Clinical manifestations and pathogenicity of Clade IIb monkeypox virus in rabbits
作者:Shang, Chao;Jiang, Qiwei;Wang, Xiaohan;Sun, Yongyang;Hu, Jinglei;Zhang, Cuiling;Liu, Zirui;Gu, Chaode;Liu, Yan;Zhao, Zongzheng;Li, Xiao;Shi, Shaowen;Shi, Wanyu;Yao, Xiaohong;Li, Wanzi;Shi, Shaowen;Shi, Shaowen;Song, Gaojie;Li, Yiquan;Zhu, Yilong
关键词:Monkeypox; animal model; rabbit; pathogenicity; clade IIb variant
-
Bacillus paralicheniformis SYN-191 isolated from ginger rhizosphere soil and its growth-promoting effects in ginger farming
作者:Sun, Yanan;Liu, Kai;Liu, Yayu;Yang, Xuerong;Du, Binghai;Li, Xiang;Zhou, Bo;Wang, Chengqiang;Liu, Zhongliang;Li, Ningyang;Li, Ningyang;Zhu, Xueming;Wang, Hailong;Peng, Bingyin
关键词:Ginger; Continuous cropping; Bacillus paralicheniformis; Plant-microbial interaction; Whole genome analysis