A hybrid machine learning model with attention mechanism and multidimensional multivariate feature coding for essential gene prediction
文献类型: 外文期刊
第一作者: Wu, Yan
作者: Wu, Yan;Li, Tan;Li, Mengshan;Xie, Xiaojun;Wu, Yan;Zhou, Weihong;Sheng, Sheng;Wang, Jun;Wu, Fu-an;Wu, Yan;Zhou, Weihong;Sheng, Sheng;Wang, Jun;Wu, Fu-an;Fu, Yu;Li, Mengshan
作者机构:
关键词: Essential gene; Machine learning; Attention mechanism; LSTM; CNN; Feature coding
期刊名称:BMC BIOLOGY ( 影响因子:4.5; 五年影响因子:5.4 )
ISSN:
年卷期: 2025 年 23 卷 1 期
页码:
收录情况: SCI
摘要: BackgroundEssential genes are crucial for the development, inheritance, and survival of species. The exploration of these genes can unravel the complex mechanisms and fundamental life processes and identify potential therapeutic targets for various diseases. Therefore, the identification of essential genes is significant. Machine learning has become the mainstream approach for essential gene prediction. However, some key challenges in machine learning need to be addressed, such as the extraction of genetic features, the impact of imbalanced data, and the cross-species generalization ability.ResultsHere, we proposed a hybrid machine learning model based on graph convolutional neural networks (GCN) and bi-directional long short-term memory (Bi-LSTM) with attention mechanism and multidimensional multivariate feature coding for essential gene prediction, called EGP Hybrid-ML. In the model, GCN was used to extract feature encoding information from the visualized graphics of gene sequences and the attention mechanism was combined with Bi-LSTM to assess the importance of each feature in gene sequences and analyze the influences of different feature encoding methods and data imbalance. Additionally, the cross-species predictive performance of the model was evaluated through cross-validation. The results indicated that the sensitivity of the EGP Hybrid-ML model reached 0.9122.ConclusionsThis model demonstrated the superior predictive performance and strong generalization capabilities compared to other models. The EGP Hybrid-ML model proposed in this paper has broad application prospects in bioinformatics, chemical information, and pharmaceutical information. The codes, architectures, parameters, and datasets of the proposed model are available free of charge at GitHub (https://github.com/gnnumsli/EGP-Hybrid-ML).
分类号:
- 相关文献
作者其他论文 更多>>
-
Inefficient C sequestration with long term high-level straw return as linked to protected C pools saturation on the North China Plain
作者:Li, Xu;Yang, Xiaonan;Li, Jingyu;Fu, Xin;Peng, Zhengping;Fu, Xin;Peng, Zhengping;Wang, Jun;Dang, Hongkai
关键词:Long-term straw return; Straw return rates; C labeling; Soil carbon saturation; Soil carbon sequestration
-
Global economic costs of invasions related to aquaculture: Addressing knowledge gaps and underestimated expenses
作者:Jiang, Xiaoming;Zheng, Peng;Sun, Zhiwei;Wang, Jun;Ren, Lei;Soto, Ismael;Oficialdegui, Francisco J.;Haubrock, Phillip J.;Gu, Dangen;Haubrock, Phillip J.;Haubrock, Phillip J.;Oficialdegui, Francisco J.;Ji, Lei
关键词:Biological invasion; Economic impacts; Introduction pathway; Invasive species; Global assessment
-
In-situ structural modification on spinel oxide to achieve efficient removal of refractory organics: Triple optimisation of degradation performance
作者:Chen, Yaoning;Zhou, Wencheng;Kang, Huayue;Zhao, Mengyang;Wang, Jun;Zhao, Chen;Zou, Bin;Jia, Xuyang;Chen, Yaoning;Zhou, Wencheng;Kang, Huayue;Zhao, Mengyang;Wang, Jun;Zhao, Chen;Zou, Bin;Jia, Xuyang;Li, Yuanping;Zhang, Wei;Liu, Yihuan
关键词:Spinel oxide; Photocatalysis; Heterogeneous structures; Oxygen vacancies; Peroxymonosulfate
-
Identifying priority habitat for future spatial conservation and management decisions of East Asian finless porpoise within Miaodao Archipelago waters, China
作者:Li, Yongtao;Cheng, Zhaolong;Zuo, Tao;Niu, Mingxiang;Wang, Jun;Li, Yongtao;Cheng, Zhaolong;Zuo, Tao;Niu, Mingxiang;Wang, Jun;Wu, Zhongxun;Chu, Yongzhong
关键词:East Asian finless porpoise; Core density area; Kernel density estimate; Spatial use; Cetacean conservation
-
Single-cell transcriptome atlas of lamprey exploring Natterin- induced white adipose tissue browning
作者:Pang, Yue;Du, Zeyu;Zhang, Jin;Lu, Jiali;Li, Jun;Dong, Xinrui;Zhao, Zhisheng;Chuan, Shunqin;Sun, Mingjie;Li, Qingwei;Pang, Yue;Du, Zeyu;Zhang, Jin;Lu, Jiali;Li, Jun;Dong, Xinrui;Zhao, Zhisheng;Chuan, Shunqin;Sun, Mingjie;Li, Qingwei;Qin, Yating;Liu, Qun;Han, Kai;Yuan, Zengbao;Pan, Shanshan;Xu, Mengyang;Wang, Dantong;Li, Zhen;Chen, Yadong;Song, Yue;Zhan, Liping;Cui, Wei;Wang, Jun;Fan, Guangyi;Qin, Yating;Liu, Qun;Han, Kai;Fan, Guangyi;Qin, Yating;Liu, Qun;Han, Kai;Song, Yue;Qin, Yating;Fan, Guangyi;Yuan, Zengbao;Xu, Mengyang;Wang, Dantong;Gu, Ying;Yang, Huanming;Xu, Xun;Liu, Xin;Fan, Guangyi;Xu, Mengyang;Fan, Guangyi;Li, Shuo;Zhang, Zhe;Ni, Ming;Jia, Xiaodong;Xia, Zhangyong;Yue, Zhen;Fan, Guangyi;Gu, Ying;Yang, Huanming;Xu, Xun;Liu, Xin
关键词:
-
Paenibacillus mesotrionivorans sp. nov., a Mesotrione-Degrading Strain Isolated from Soil
作者:Song, Ye;Wu, Yan;Ruan, Luyao;Wan, Minglai;Liu, Bin;He, Jian;Chen, Leyao;Wan, Minglai;Zhang, Baolong;He, Jian;He, Jian
关键词:
-
Enzymatic synthesis of structured phospholipids with ideal w-6/3 fatty acid ratios via supplementation with α-linolenic acid from silkworm pupae oil
作者:Tan, Lu;Wang, Jin-Zheng;Wang, Xin-Ying;Yan, Cheng-Hai;Qu, Ya-Xin;Huang, Ze-Lai;Bai, Tao;Wang, Jun;Wang, Jun;Wang, Jun;Qian, Jun-Feng;Qian, Jun-Feng
关键词:Structured phospholipids; Silkworm pupae oil; alpha-linolenic acid; Functional foods