High-quality genome assembly enables prediction of allele-specific gene expression in hybrid poplar

文献类型: 外文期刊

第一作者: Shi, Tian-Le

作者: Shi, Tian-Le;Jia, Kai-Hua;Bao, Yu-Tao;Tian, Xue-Chan;Yan, Xue-Mei;Chen, Zhao-Yang;Li, Zhi-Chao;Zhao, Shi-Wei;Ma, Hai-Yao;Zhao, Ye;An, Xinmin;Mao, Jian-Feng;Jia, Kai-Hua;Nie, Shuai;Nie, Shuai;Nie, Shuai;Li, Xiang;Zhang, Ren-Gang;Guo, Jing;Zhao, Wei;Wang, Xiao-Ru;El-Kassaby, Yousry Aly;Mueller, Niels;van de Peer, Yves;van de Peer, Yves;van de Peer, Yves;van de Peer, Yves;Street, Nathaniel Robert;Mao, Jian-Feng;Porth, Ilga

作者机构:

期刊名称:PLANT PHYSIOLOGY ( 影响因子:7.4; 五年影响因子:8.7 )

ISSN: 0032-0889

年卷期: 2024 年

页码:

收录情况: SCI

摘要: Poplar (Populus) is a well-established model system for tree genomics and molecular breeding, and hybrid poplar is widely used in forest plantations. However, distinguishing its diploid homologous chromosomes is difficult, complicating advanced functional studies on specific alleles. In this study, we applied a trio-binning design and PacBio high-fidelity long-read sequencing to obtain haplotype-phased telomere-to-telomere genome assemblies for the 2 parents of the well-studied F1 hybrid "84K" (Populus alba x Populus tremula var. glandulosa). Almost all chromosomes, including the telomeres and centromeres, were completely assembled for each haplotype subgenome apart from 2 small gaps on one chromosome. By incorporating information from these haplotype assemblies and extensive RNA-seq data, we analyzed gene expression patterns between the 2 subgenomes and alleles. Transcription bias at the subgenome level was not uncovered, but extensive-expression differences were detected between alleles. We developed machine-learning (ML) models to predict allele-specific expression (ASE) with high accuracy and identified underlying genome features most highly influencing ASE. One of our models with 15 predictor variables achieved 77% accuracy on the training set and 74% accuracy on the testing set. ML models identified gene body CHG methylation, sequence divergence, and transposon occupancy both upstream and downstream of alleles as important factors for ASE. Our haplotype-phased genome assemblies and ML strategy highlight an avenue for functional studies in Populus and provide additional tools for studying ASE and heterosis in hybrids.

分类号:

  • 相关文献
作者其他论文 更多>>