您好,欢迎访问湖北省农业科学院 机构知识库!

Effective identification of varieties by nucleotide polymorphisms and its application for essentially derived variety identification in rice

文献类型: 外文期刊

作者: Yuan, Xiong 1 ; Li, Zirong 1 ; Xiong, Liwen 1 ; Song, Sufeng 2 ; Zheng, Xingfei 3 ; Tang, Zhonghai 4 ; Yuan, Zheming 1 ; Li, Lanzhi 1 ;

作者机构: 1.Hunan Agr Univ, Hunan Engn & Technol Res Ctr Agr Big Data Anal &, Changsha 410128, Peoples R China

2.Hunan Hybrid Rice Res Ctr, State Key Lab Hybrid Rice, Changsha 410125, Peoples R China

3.Hubei Acad Agr Sci, Food Crop Inst, Hubei Key Lab Food Crop Germplasm & Genet Improve, Wuhan 430064, Peoples R China

4.Hunan Agr Univ, Coll Food Sci & Technol, Changsha 410128, Peoples R China

关键词: Variety identification; Essentially derived variety; Rice; SNP; Whole-genome sequencing

期刊名称:BMC BIOINFORMATICS ( 影响因子:3.307; 五年影响因子:4.341 )

ISSN: 1471-2105

年卷期: 2022 年 23 卷 1 期

页码:

收录情况: SCI

摘要: Background Plant variety identification is the one most important of agricultural systems. Development of DNA marker profiles of released varieties to compare with candidate variety or future variety is required. However, strictly speaking, scientists did not use most existing variety identification techniques for "identification" but for "distinction of a limited number of cultivars," of which generalization ability always not be well estimated. Because many varieties have similar genetic backgrounds, even some essentially derived varieties (EDVs) are involved, which brings difficulties for identification and breeding progress. A fast, accurate variety identification method, which also has good performance on EDV determination, needs to be developed. Results In this study, with the strategy of "Divide and Conquer," a variety identification method Conditional Random Selection (CRS) method based on SNP of the whole genome of 3024 rice varieties was developed and be applied in essentially derived variety (EDV) identification of rice. CRS is a fast, efficient, and automated variety identification method. Meanwhile, in practical, with the optimal threshold of identity score searched in this study, the set of SNP (including 390 SNPs) showed optimal performance on EDV and non-EDV identification in two independent testing datasets. Conclusion This approach first selected a minimal set of SNPs to discriminate non-EDVs in the 3000 Rice Genome Project, then united several simplified SNP sets to improve its generalization ability for EDV and non-EDV identification in testing datasets. The results suggested that the CRS method outperformed traditional feature selection methods. Furthermore, it provides a new way to screen out core SNP loci from the whole genome for DNA fingerprinting of crop varieties and be useful for crop breeding.

  • 相关文献
作者其他论文 更多>>