SVLearn: a dual-reference machine learning approach enables accurate cross-species genotyping of structural variants
文献类型: 外文期刊
第一作者: Yang, Qimeng
作者: Yang, Qimeng;Wang, Xinyu;Wang, Jiong;Bian, Peipei;Dai, Xuelei;Gong, Mian;Zhang, Zhuangbiao;Wang, Ao;Bai, Fengting;Li, Ran;Cai, Yudong;Jiang, Yu;Sun, Jianfeng;Liu, Quanzhong;Zhang, Xin;Wang, Sizhe;Hao, Ran;Ru, Jinlong;Dai, Xuelei;Gong, Mian
作者机构:
期刊名称:NATURE COMMUNICATIONS ( 影响因子:15.7; 五年影响因子:17.2 )
ISSN:
年卷期: 2025 年 16 卷 1 期
页码:
收录情况: SCI
摘要: Structural variations (SVs) are diverse forms of genetic alterations and drive a wide range of human diseases. Accurately genotyping SVs, particularly occurring at repetitive genomic regions, from short-read sequencing data remains challenging. Here, we introduce SVLearn, a machine-learning approach for genotyping bi-allelic SVs. It exploits a dual-reference strategy to engineer a curated set of genomic, alignment, and genotyping features based on a reference genome in concert with an allele-based alternative genome. Using 38,613 human-derived SVs, we show that SVLearn significantly outperforms four state-of-the-art tools, with precision improvements of up to 15.61% for insertions and 13.75% for deletions in repetitive regions. On two additional sets of 121,435 cattle SVs and 113,042 sheep SVs, SVLearn demonstrates a strong generalizability to cross-species genotype SVs with a weighted genotype concordance score of up to 90%. Notably, SVLearn enables accurate genotyping of SVs at low sequencing coverage, which is comparable to the accuracy at 30x coverage. Our studies suggest that SVLearn can accelerate the understanding of associations between the genome-scale, high-quality genotyped SVs and diseases across multiple species.
分类号:
- 相关文献
作者其他论文 更多>>
-
Pathogenicity of Bipolaris oryzae isolated from banana to different plant species
作者:Zhang, Zhaojing;Zhao, Hong;Zhang, Zhaojing;Zhao, Hong;Ouyang, Yanfei;Zhang, Xin;Qi, Yanxiang;Ouyang, Yanfei
关键词:
Bipolaris oryzae ; Host range; Identification; In vitro inoculation -
Effects of different raw materials on bacterial community and flavor compounds in fermented red sour soup
作者:Zhang, Xin;Du, Ming;Wang, Zhenyu;Zhang, Xin;Ye, Shuhong;Du, Ming;Wang, Zhenyu;Xiong, Kexin;Xiong, Kexin
关键词:Red sour soup; Fermentation; Flavor substances; High-throughput sequencing; Flora structure; Lack of raw materials
-
Estimation Model for Cotton Canopy Structure Parameters Based on Spectral Vegetation Index
作者:Qi, Yaqin;Chen, Xi;Chen, Zhengchao;Zhang, Hao;Qi, Yaqin;Zhang, Xin;Shen, Congju;Chen, Bing;Wang, Qiong;Liu, Taijie;Qi, Yaqin;Chen, Yan;Peng, Yuanying
关键词:spectral vegetation index; cotton canopy information; estimation model
-
Rapid adaptation of Bacillus thuringiensis to alkaline environments via the L-lactate metabolism pathway regulated by the CRP/FNR family regulator LtmR
作者:Peng, Qi;Qin, Jiaxin;Xu, Hong;Kao, Guiwei;Yang, Fan;Sun, Zhongqin;Zhang, Xin;Song, Fuping;Qin, Jiaxin;Sun, Zhongqin;Guo, Shuyuan;Peng, Qi;Slamti, Leyla
关键词:Bacillus thuringiensis; CRP/FNR family regulator; Alkaline adaptation
-
Evaluation of the strategy for insecticidal crystal encapsulation with cell wall in industrial processes
作者:Hou, Yujia;Zhang, Xin;Xu, Yanrong;Zhu, Yuanhong;Yan, Tinglu;Peng, Qi;Song, Fuping;Zhang, Xin;Zhou, Li;Liu, Huamei
关键词:
Bacillus thuringiensis ; industrial processes; mother cell lysis; insecticidal crystal encapsulation; UV resistance -
OsPAD1, encoding a non-specific lipid transfer protein, is required for rice pollen aperture formation
作者:Wang, Qiming;Tian, Yunlu;Chen, Keyi;Wang, Chaolong;Yu, Xiaowen;Bai, Wenting;Zheng, Hai;You, Shimin;Hu, Yang;Lei, Dekun;Jian, Anqi;Lu, Jiayu;Yu, Hao;Jiang, Ling;Zhao, Zhigang;Wan, Jianmin;Zhu, Shanshan;Xiong, Yehui;Zhang, Xin;Ren, Yulong;Lei, Cailin;Cheng, Zhijun;Lin, Qibing;Wan, Jianmin
关键词:Sterile; Fibrillar-granular layer; Operculum; Lipid binding;
Oriza sativa -
Magnetic graphene-enhanced exonuclease III assisted amplification strategy driven carbon nanozyme for tri-mode detection of Escherichia coli O157:H7
作者:Ren, Yuwei;Cao, Lulu;Li, Zongyang;Zhang, Xin;Li, Hui;Jiao, Rui;Yu, Xiaoyu;Liang, Zhao;Li, Ganghui;Xia, Xiyang;Ling, Na;Zhang, Danfeng;Ye, Yingwang;Cheng, Jianghua
关键词:Tri-mode detection; Escherichia coli O157:H7; Cu-cl co-doped nanozyme; Cascade signal amplification; Deep learning