Fast genomic prediction of breeding values using parallel Markov chain Monte Carlo with convergence diagnosis

文献类型: 外文期刊

第一作者: Guo, Peng

作者: Guo, Peng;Zhu, Bo;Niu, Hong;Wang, Zezhao;Liang, Yonghu;Chen, Yan;Zhang, Lupei;Gao, Xue;Gao, Huijiang;Xu, Lingyang;Li, Junya;Guo, Peng;Ni, Hemin;Guo, Yong;Hay, El Hamidi A.;Wu, Xiaolin;Wu, Xiaolin

作者机构:

关键词: Bayesian models;Convergence diagnosis;Genomic prediction;High-performance computing;Tunable burn-in

期刊名称:BMC BIOINFORMATICS ( 影响因子:3.169; 五年影响因子:3.629 )

ISSN: 1471-2105

年卷期: 2018 年 19 卷

页码:

收录情况: SCI

摘要: Background: Running multiple-chain Markov Chain Monte Carlo (MCMC) provides an efficient parallel computing method for complex Bayesian models, although the efficiency of the approach critically depends on the length of the non-parallelizable burn-in period, for which all simulated data are discarded. In practice, this burn-in period is set arbitrarily and often leads to the performance of far more iterations than required. In addition, the accuracy of genomic predictions does not improve after the MCMC reaches equilibrium. Results: Automatic tuning of the burn-in length for running multiple-chain MCMC was proposed in the context of genomic predictions using BayesA and BayesC pi models. The performance of parallel computing versus sequential computing and tunable burn-in MCMC versus fixed burn-in MCMC was assessed using simulation data sets as well by applying these methods to genomic predictions of a Chinese Simmental beef cattle population. The results showed that tunable burn-in parallel MCMC had greater speedups than fixed burn-in parallel MCMC, and both had greater speedups relative to sequential (single-chain) MCMC. Nevertheless, genomic estimated breeding values (GEBVs) and genomic prediction accuracies were highly comparable between the various computing approaches. When applied to the genomic predictions of four quantitative traits in a Chinese Simmental population of 1217 beef cattle genotyped by an Illumina Bovine 770 K SNP BeadChip, tunable burn-in multiple-chain BayesCp (TBM-BayesC pi) outperformed tunable burn-in multiple-chain BayesCp (TBM-BayesA) and Genomic Best Linear Unbiased Prediction (GBLUP) in terms of the prediction accuracy, although the differences were not necessarily caused by computational factors and could have been intrinsic to the statistical models per se. Conclusions: Automatically tunable burn-in multiple-chain MCMC provides an accurate and cost-effective tool for high-performance computing of Bayesian genomic prediction models, and this algorithm is generally applicable to high-performance computing of any complex Bayesian statistical model.

分类号:

  • 相关文献

[1]Genomic prediction with epistasis models: on the marker-coding-dependent performance of the extended GBLUP and properties of the categorical epistasis model (CE). Martini, Johannes W. R.,Gao, Ning,Cardoso, Diercles F.,Erbe, Malena,Simianer, Henner,Gao, Ning,Cardoso, Diercles F.,Cardoso, Diercles F.,Erbe, Malena,Cantet, Rodolfo J. C.. 2017

[2]Accuracy of genomic prediction for growth and carcass traits in Chinese triple-yellow chickens. Liu, Tianfei,Qu, Hao,Luo, Chenglong,Shu, Dingming,Wang, Jie,Liu, Tianfei,Lund, Mogens Sando,Su, Guosheng,Liu, Tianfei,Qu, Hao,Luo, Chenglong,Shu, Dingming,Wang, Jie. 2014

[3]Genomic heritability estimation for the early life-history transition related to propensity to migrate in wild rainbow and steelhead trout populations. Hu, Guo,Hu, Guo,Wang, Chunkao,Da, Yang. 2014

[4]Canine hip dysplasia is predictable by genotyping. Zhang, Z.,Guo, G.,Wang, Y.,Zhang, Y.,Guo, G.,Zhou, Z.,Li, J.,Zhou, Z.,Hunter, L.,Friedenberg, S.,Krotscheck, U.,Todhunter, R.,Zhu, L.,Lust, G.,Harris, S.,Jones, P.,Sandler, J.,Zhao, K.,Zhou, Z.. 2011

[5]Accuracy of genomic prediction using low-density marker panels. Zhang, Z.,Ding, X.,Liu, J.,Zhang, Q.,Zhang, Z.,de Koning, D. -J.,Zhang, Z.,de Koning, D. -J.,de Koning, D. -J..

[6]Effects of marker density and minor allele frequency on genomic prediction for growth traits in Chinese Simmental beef cattle. Zhu Bo,Zhang Jing-jing,Niu Hong,Guan Long,Guo Peng,Xu Ling-yang,Chen Yan,Zhang Lu-pei,Gao Hui-jiang,Gao Xue,Li Jun-ya. 2017

[7]Genome wide association study and genomic prediction for fatty acid composition in Chinese Simmental beef cattle using high density SNP array. Zhu, Bo,Niu, Hong,Zhang, Wengang,Wang, Zezhao,Liang, Yonghu,Guan, Long,Guo, Peng,Chen, Yan,Zhang, Lupei,Gao, Xue,Gao, Huijiang,Xu, Lingyang,Li, Junya,Guo, Yong,Xu, Lingyang. 2017

作者其他论文 更多>>