Incorporating Gene Annotation into Genomic Prediction of Complex Phenotypes

文献类型: 外文期刊

第一作者: Gao, Ning

作者: Gao, Ning;Zhang, Zhe;Yuan, Xiaolong;Zhang, Hao;Li, Jiaqi;Gao, Ning;Martini, Johannes W. R.;Simianer, Henner

作者机构:

关键词: genomic selection;gene annotation;categorical model;haplotype;GenPred;Shared Data Resources

期刊名称:GENETICS ( 影响因子:4.562; 五年影响因子:4.845 )

ISSN: 0016-6731

年卷期: 2017 年 207 卷 2 期

页码:

收录情况: SCI

摘要: Today, genomic prediction (GP) is an established technology in plant and animal breeding programs. Current standard methods are purely based on statistical considerations but do not make use of the abundant biological knowledge, which is easily available from public databases. Major questions that have to be answered before biological prior information can be used routinely in GP approaches are which types of information can be used, and at which points they can be incorporated into prediction methods. In this study, we propose a novel strategy to incorporate gene annotation into GP of complex phenotypes by defining haploblocks according to gene positions. Haplotype effects are then modeled as categorical or as numerical allele dosage variables. The underlying concept of this approach is to build the statistical model on variables representing the biologically functional units. We evaluate the new methods with data from a heterogeneous stock mouse population, the Drosophila Genetic Reference Panel (DGRP), and a rice breeding population from the Rice Diversity Panel. Our results show that using gene annotation to define haploblocks often leads to a comparable, but for some traits to a higher, predictive ability compared to SNP-based models or to haplotype models that do not use gene annotation information. Modeling gene interaction effects can further improve predictive ability. We also illustrate that the additional use of markers that have not been mapped to any gene in a second separate relatedness matrix does in many cases not lead to a relevant additional increase in predictive ability when the first matrix is based on haploblocks defined with gene annotation data, suggesting that intergenic markers only provide redundant information on the considered data sets. Therefore, gene annotation information seems to be appropriate to perceive the importance of DNA segments. Finally, we discuss the effects of gene annotation quality, marker density, and linkage disequilibrium on the performance of the new methods. To our knowledge, this is the first work that incorporates epistatic interaction or gene annotation into haplotype-based prediction approaches.

分类号:

  • 相关文献

[1]Genome-Wide Association Study and QTL Mapping Reveal Genomic Loci Associated with Fusarium Ear Rot Resistance in Tropical Maize Germplasm. Chen, Jiafa,Ding, Junqiang,Wu, Jianyu,Chen, Jiafa,Ding, Junqiang,Wu, Jianyu,Wu, Jianyu,Chen, Jiafa,Shrestha, Rosemary,Zheng, Hongjian,Mu, Chunhua,Mahuku, George,Zheng, Hongjian,Mu, Chunhua,Mahuku, George. 2016

[2]Transcriptome Analysis Suggests That Chromosome Introgression Fragments from Sea Island Cotton (Gossypium barbadense) Increase Fiber Strength in Upland Cotton (Gossypium hirsutum). Quanwei Lu,Yuzhen Shi,Huang, Jinling,Yuan, Youlu,Xianghui Xiao,Pengtao Li,Juwu Gong,Wankui Gong,Aiying Liu,Haihong Shang,Junwen Li,Qun Ge,Weiwu Song,Shaoqi Li,Zhen Zhang,Md Harun or Rashid,Renhai Peng,Youlu Yuan,Jinling Huang. 2017

[3]Genome-Wide SNP Discovery and Analysis of Genetic Diversity in Farmed Sika Deer (Cervus nippon) in Northeast China Using Double-Digest Restriction Site-Associated DNA Sequencing. Ba, Hengxing,Jia, Boyin,Wang, Guiwu,Yang, Yifeng,Li, Chunyi,Kedem, Gilead. 2017

[4]Whole genome re-sequencing and transcriptome analysis of the Stylosanthes Anthracnose pathogen Colletotrichum gloeosporioides reveal its characteristics. Huang, H. P.,Huang, H. P.,Huang, J. H.,Zheng, J. L.,Yi, K. X.,Ma, S.. 2016

[5]Genome-wide detection of selective signatures in Simmental cattle. Fan, Huizhong,Wu, Yang,Qi, Xin,Zhang, Jingjing,Li, Juan,Gao, Xue,Zhang, Lupei,Li, Junya,Gao, HuiJiang,Zhang, Jingjing.

[6]Accuracy of Whole-Genome Prediction Using a Genetic Architecture-Enhanced Variance-Covariance Matrix. Zhang, Zhe,He, Jinlong,Gao, Ning,Zhang, Hao,Li, Jiaqi,Erbe, Malena,Ober, Ulrike,Simianer, Henner. 2015

[7]Transcriptome analysis reveals the effect of pre-harvest CPPU treatment on the volatile compounds emitted by kiwifruit stored at room temperature. Luo, Jing,Guo, Linlin,Huang, Yunan,Wang, Chao,Qiao, Chengkui,Pang, Rongli,Li, Jun,Pang, Tao,Wang, Ruiping,Xie, Hanzhong,Fang, Jinbao. 2017

[8]Genome-wide association for grain yield under rainfed conditions in historical wheat cultivars from Pakistan. Ain, Qurat-ul,Anwar, Alia,Mahmood, Tariq,Quraishi, Umar M.,Rasheed, Awais,Xia, Xianchun,He, Zhonghu,Rasheed, Awais,He, Zhonghu,Mahmood, Tariq,Imtiaz, Muhammad. 2015

[9]Comparative study of estimation methods for genomic breeding values. Wang, Chonglong,Qian, Rong,Wang, Chonglong,Zhang, Qin,Jiang, Li,Ding, Xiangdong,Wang, Chonglong,Zhao, Yaofeng. 2016

[10]Achievements and prospects of genomics-assisted breeding in three legume crops of the semi-arid tropics. Varshney, Rajeev K.,Mohan, S. Murali,Gaur, Pooran M.,Pandey, Manish K.,Sawargaonkar, Shrikant L.,Chitikineni, Annapurna,Janila, Pasupuleti,Saxena, K. B.,Sharma, Mamta,Rathore, Abhishek,Mallikarjuna, Nalini,Gowda, C. L. L.,Varshney, Rajeev K.,Varshney, Rajeev K.,Varshney, Rajeev K.,Liang, Xuanqiang,Gangarao, N. V. P. R.,Pandey, Manish K.,Bohra, Abhishek,Pratap, Aditya,Datta, Subhojit,Chaturvedi, S. K.,Nadarajan, N.,Kimurto, Paul K.,Fikre, Asnake,Tripathi, Shailesh,Bharadwaj, Ch.,Anuradha, G.,Babbar, Anita,Choudhary, Arbind K.,Mhase, M. B.,Mannur, D. M.. 2013

[11]Improving accuracy of genomic prediction by genetic architecture based priors in a Bayesian model. Gao, Ning,Li, Jiaqi,He, Jinlong,Xiao, Guang,Luo, Yuanyu,Zhang, Hao,Chen, Zanmou,Zhang, Zhe,Gao, Ning,Zhang, Zhe. 2015

[12]Accuracy of genomic prediction for milk production traits in the Chinese Holstein population using a reference population consisting of cows. Ding, X.,Zhang, Z.,Li, X.,Wang, S.,Wu, X.,Sun, D.,Yu, Y.,Liu, J.,Wang, Y.,Zhang, Y.,Zhang, S.,Zhang, Y.,Zhang, Q.,Zhang, Z.. 2013

[13]Genetic parameters and trends for production and reproduction traits of a Landrace herd in China. Zhang Zhe,Zhang Hao,Pan Rong-yang,Wu Long,Li Ya-lan,Chen Zan-mou,Cai Geng-yuan,Li Jia-qi,Wu Zhen-fang. 2016

[14]Potential of marker selection to increase prediction accuracy of genomic selection in soybean (Glycine max L.). Li, Wenbin,Ma, Yansong,Liu, Zhangxiong,Guo, Yong,Qiu, Lijuan,Ma, Yansong,Luan, Xiaoyan,Reif, Jochen C.,Jiang, Yong,Wen, Zixiang,Wang, Dechun,Han, Tianfu,Wu, Cunxiang,Sun, Shi,Wei, Shuhong,Wang, Shuming,Yang, Chunming,Wang, Huicai,Yang, Chunming,Zhang, Mengchen,Lu, Weiguo,Xu, Ran,Zhou, Rong,Zhou, Xinan,Wang, Ruizhen,Sun, Zudong,Chen, Huaizhu,Zhang, Wanhai,Sun, Bincheng,Wu, Jian,Han, Dezhi,Yan, Hongrui,Hu, Guohua,Liu, Chunyan,Fu, Yashu,Chen, Weiyuan,Guo, Tai,Zhang, Lei,Yuan, Baojun.

[15]The strategy and potential utilization of temperate germplasm for tropical germplasm improvement: a case study of maize (Zea mays L.). Wen, Weiwei,Tovar, Victor H. Chavez,Taba, Suketoshi,Wen, Weiwei,Yan, Jianbing,Guo, Tingting,Li, Huihui. 2012

[16]Effect of Trait Heritability, Training Population Size and Marker Density on Genomic Prediction Accuracy Estimation in 22 bi-parental Tropical Maize Populations. Zhang, Ao,Liu, Yubo,Cui, Zhenhai,Ruan, Yanye,Yu, Haiqiu,Zhang, Ao,Wang, Hongwu,Liu, Yubo,Burgueno, Juan,San Vicente, Felix,Crossa, Jose,Zhang, Xuecai,Wang, Hongwu,Beyene, Yoseph,Semagn, Kassa,Olsen, Michael,Prasanna, Boddupalli M.,Cao, Shiliang,Semagn, Kassa. 2017

[17]Comparison of single-trait and multiple-trait genomic prediction models. Guo, Gang,Zhao, Fuping,Du, Lixin,Guo, Gang,Guo, Gang,Wang, Yachun,Zhang, Yuan,Guo, Gang,Su, Guosheng. 2014

[18]Accuracy of genomic prediction using low-density marker panels. Zhang, Z.,Ding, X.,Liu, J.,Zhang, Q.,Zhang, Z.,de Koning, D. -J.,Zhang, Z.,de Koning, D. -J.,de Koning, D. -J..

[19]Whole-genome strategies for marker-assisted plant breeding. Xu, Yunbi,Lu, Yanli,Gao, Shibin,Prasanna, Boddupalli M.. 2012

[20]A novel genomic selection method combining GBLUP and LASSO. Li, Hengde,Wang, Jingwei,Li, Hengde,Bao, Zhenmin,Wang, Jingwei.

作者其他论文 更多>>