您好,欢迎访问广东省农业科学院 机构知识库!

A pangenome analysis pipeline provides insights into functional gene identification in rice

文献类型: 外文期刊

作者: Wang, Jian 1 ; Yang, Wu 1 ; Zhang, Shaohong 1 ; Hu, Haifei 1 ; Yuan, Yuxuan 5 ; Dong, Jingfang 1 ; Chen, Luo 1 ; Ma, Yamei 1 ; Yang, Tifeng 1 ; Zhou, Lian 1 ; Chen, Jiansong 1 ; Liu, Bin 1 ; Li, Chengdao 4 ; Edwards, David 7 ; Zhao, Junliang 1 ;

作者机构: 1.Guangdong Acad Agr Sci, Rice Res Inst, Guangzhou 510640, Peoples R China

2.Guangdong Acad Agr Sci, Guangdong Key Lab New Technol Rice Breeding, Guangzhou 510640, Peoples R China

3.Guangdong Acad Agr Sci, Guangdong Rice Engn Lab, Guangzhou 510640, Peoples R China

4.Murdoch Univ, Western Crop Genet Alliance, Murdoch, WA 6150, Australia

5.Chinese Univ Hong Kong, Sch Life Sci, Hong Kong, Peoples R China

6.Chinese Univ Hong Kong, State Key Lab Agrobiotechnol, Hong Kong, Peoples R China

7.Univ Western Australia, Sch Biol Sci, Perth, WA, Australia

8.Univ Western Australia, Ctr Appl Bioinformat, Perth, WA, Australia

关键词: Pangenome; Presence; absence variation; Genomic diversity; PAV-based GWAS

期刊名称:GENOME BIOLOGY ( 影响因子:12.3; 五年影响因子:17.4 )

ISSN: 1474-760X

年卷期: 2023 年 24 卷 1 期

页码:

收录情况: SCI

摘要: BackgroundA pangenome aims to capture the complete genetic diversity within a species and reduce bias in genetic analysis inherent in using a single reference genome. However, the current linear format of most plant pangenomes limits the presentation of position information for novel sequences. Graph pangenomes have been developed to overcome this limitation. However, bioinformatics analysis tools for graph format genomes are lacking.ResultsTo overcome this problem, we develop a novel strategy for pangenome construction and a downstream pangenome analysis pipeline (PSVCP) that captures genetic variants' position information while maintaining a linearized layout. Using PSVCP, we construct a high-quality rice pangenome using 12 representative rice genomes and analyze an international rice panel with 413 diverse accessions using the pangenome as the reference. We show that PSVCP successfully identifies causal structural variations for rice grain weight and plant height. Our results provide insights into rice population structure and genomic diversity. We characterize a new locus (qPH8-1) associated with plant height on chromosome 8 undetected by the SNP-based genome-wide association study (GWAS).ConclusionsOur results demonstrate that the pangenome constructed by our pipeline combined with a presence and absence variation-based GWAS can provide additional power for genomic and genetic analysis. The pangenome constructed in this study and the associated genome sequence and genetic variants data provide valuable genomic resources for rice genomics research and improvement in future.

  • 相关文献
作者其他论文 更多>>