A hybrid machine learning model with attention mechanism and multidimensional multivariate feature coding for essential gene prediction
文献类型: 外文期刊
第一作者: Wu, Yan
作者: Wu, Yan;Li, Tan;Li, Mengshan;Xie, Xiaojun;Wu, Yan;Zhou, Weihong;Sheng, Sheng;Wang, Jun;Wu, Fu-an;Wu, Yan;Zhou, Weihong;Sheng, Sheng;Wang, Jun;Wu, Fu-an;Fu, Yu;Li, Mengshan
作者机构:
关键词: Essential gene; Machine learning; Attention mechanism; LSTM; CNN; Feature coding
期刊名称:BMC BIOLOGY ( 影响因子:4.5; 五年影响因子:5.4 )
ISSN:
年卷期: 2025 年 23 卷 1 期
页码:
收录情况: SCI
摘要: BackgroundEssential genes are crucial for the development, inheritance, and survival of species. The exploration of these genes can unravel the complex mechanisms and fundamental life processes and identify potential therapeutic targets for various diseases. Therefore, the identification of essential genes is significant. Machine learning has become the mainstream approach for essential gene prediction. However, some key challenges in machine learning need to be addressed, such as the extraction of genetic features, the impact of imbalanced data, and the cross-species generalization ability.ResultsHere, we proposed a hybrid machine learning model based on graph convolutional neural networks (GCN) and bi-directional long short-term memory (Bi-LSTM) with attention mechanism and multidimensional multivariate feature coding for essential gene prediction, called EGP Hybrid-ML. In the model, GCN was used to extract feature encoding information from the visualized graphics of gene sequences and the attention mechanism was combined with Bi-LSTM to assess the importance of each feature in gene sequences and analyze the influences of different feature encoding methods and data imbalance. Additionally, the cross-species predictive performance of the model was evaluated through cross-validation. The results indicated that the sensitivity of the EGP Hybrid-ML model reached 0.9122.ConclusionsThis model demonstrated the superior predictive performance and strong generalization capabilities compared to other models. The EGP Hybrid-ML model proposed in this paper has broad application prospects in bioinformatics, chemical information, and pharmaceutical information. The codes, architectures, parameters, and datasets of the proposed model are available free of charge at GitHub (https://github.com/gnnumsli/EGP-Hybrid-ML).
分类号:
- 相关文献
作者其他论文 更多>>
-
Inefficient C sequestration with long term high-level straw return as linked to protected C pools saturation on the North China Plain
作者:Li, Xu;Yang, Xiaonan;Li, Jingyu;Fu, Xin;Peng, Zhengping;Fu, Xin;Peng, Zhengping;Wang, Jun;Dang, Hongkai
关键词:Long-term straw return; Straw return rates; C labeling; Soil carbon saturation; Soil carbon sequestration
-
Global economic costs of invasions related to aquaculture: Addressing knowledge gaps and underestimated expenses
作者:Jiang, Xiaoming;Zheng, Peng;Sun, Zhiwei;Wang, Jun;Ren, Lei;Soto, Ismael;Oficialdegui, Francisco J.;Haubrock, Phillip J.;Gu, Dangen;Haubrock, Phillip J.;Haubrock, Phillip J.;Oficialdegui, Francisco J.;Ji, Lei
关键词:Biological invasion; Economic impacts; Introduction pathway; Invasive species; Global assessment
-
Transcriptomic insight into the underlying mechanism of induced molting on reproductive remodeling, performance and egg quality in laying hen
作者:Ma, Pengyun;Chen, Jilan;Zhang, Xiaoke;Xu, Xinying;Ma, Zhong;Li, Yunlei;Ma, Pengyun;Xue, Fuguang;Zhang, Hao;Wu, Yan;Li, Ling;Qu, Yuanqi
关键词:Induced molting; Reproductive remodeling; Egg quality; Transcriptomic analysis; Laying hen
-
Invasion-Migration-Wear Mechanism of Hard Particles at the Interface of Water-Lubricated Rubber Bearing Under Friction Vibration Excitation
作者:Kuang, Fuming;He, Qing;Kuang, Fuming;Zhu, Anbang;Zhu, Dequan;Li, Qing;Zhou, Xincong;Yuan, Chengqing;Qin, Hongling;Cao, Pan;Wang, Jun
关键词:Water lubricated rubber bearing; Hard particle; Invasion-migration-wear mechanism; Frictional vibration
-
Transcriptome analysis and functional study of phospholipase A2 in Galleria mellonella larvae lipid metabolism in response to envenomation by an ectoparasitoid, Iseropus kuwanae
作者:Zhu, Hanqi;Liang, Xinhao;Ding, Jianhao;Wang, Jinzheng;Li, Ping;Zhou, Weihong;Wang, Jun;Wu, Fu-an;Sheng, Sheng;Li, Ping;Zhou, Weihong;Wang, Jun;Wu, Fu-an;Sheng, Sheng
关键词:ectoparasitoids; Galleria mellonella; Iseropus kuwanae; lipid metabolism; the greater wax moth; transcriptome; wasp development
-
TMT-Based quantitative proteomic analysis reveals age-related changes in eggshell matrix proteins and their correlation with eggshell quality in Xinyang blue-shelled laying hens
作者:Fu, Yu;Zhao, Dan-rong;Gao, Li-bing;Zhang, Hai-jun;Qi, Guang-hai;Wang, Jing;Zhao, Dan-rong;Feng, Jia;Min, Yu-na
关键词:Eggshell quality; Eggshell ultrastructure; Eggshell component; Matrix protein; Proteomics
-
Foisc1 regulates growth, conidiation, sensitivity to salicylic acid, and pathogenicity of Fusarium oxysporum f. sp. cubense tropical race 4
作者:Guo, Lijia;Wang, Jun;Zhou, You;Liang, Changcong;Liu, Lei;Yang, Yang;Huang, Junsheng;Yang, Laying;Guo, Lijia;Wang, Jun;Zhou, You;Liang, Changcong;Liu, Lei;Yang, Yang;Huang, Junsheng;Yang, Laying;Guo, Lijia;Wang, Jun;Zhou, You;Liang, Changcong;Liu, Lei;Yang, Yang;Huang, Junsheng;Yang, Laying;Guo, Lijia;Wang, Jun;Zhou, You;Liang, Changcong;Liu, Lei;Yang, Yang;Huang, Junsheng;Yang, Laying
关键词:Vascular wilt of banana; Isochorismatase; Pathogenicity; Salicylic acid; Defense response; Fusaric acid biosynthesis