您好,欢迎访问北京市农林科学院 机构知识库!

Maize yield prediction using federated random forest

文献类型: 外文期刊

作者: Zhang, Qiusi 1 ; Zhao, Xiangyu 2 ; Han, Yanyun 2 ; Yang, Feng 2 ; Pan, Shouhui 2 ; Liu, Zhongqiang 2 ; Wang, Kaiyi 1 ; Zhao, Chunjiang 1 ;

作者机构: 1.Anhui Univ, Natl Engn Res Ctr Agroecol Big Data Anal & Applica, Sch Elect & Informat Engn, Hefei, Peoples R China

2.Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing, Peoples R China

3.Natl Engn Res Ctr Informat Technol Agr, Beijing, Peoples R China

关键词: Yield prediction; Joint breeding; Federated learning; Intelligent agriculture

期刊名称:COMPUTERS AND ELECTRONICS IN AGRICULTURE ( 影响因子:8.3; 五年影响因子:8.3 )

ISSN: 0168-1699

年卷期: 2023 年 210 卷

页码:

收录情况: SCI

摘要: Crop variety yield prediction is important for the global food supply. In recent years, machine learning models have been successfully applied in this domain. However, most seed companies typically have a limited number of trial sites they can set up and lack sufficient data for training machine learning models individually, which prevents them from benefiting from state-of-the-art AI technologies in the age of intelligent breeding. The pri-mary aim of this study is to propose a novel approach based on a federated random forest algorithm that enables breeding institutions to collaborate without disclosing or sharing their own data, and to jointly model with tabular data including field breeding test phenotypes and environmental meteorology data stored locally by each participating institution. With a focus on a real tabular dataset obtained from maize field trials at 248 trial sites from the China National Crop Variety Tests from 2017 to 2021, this paper presents the first results that delve into phenotypic data and explore federated learning algorithms based on decision trees in the field of crop yield prediction. Empirical verification of the maize crop yield prediction scenario showed that the method not only performs better than each of the models trained on an individual data source, but also is virtually lossless in accuracy compared with traditional, data-centralized random forest approach. Additionally, the method provides a cost-effective and efficient alternative for joint breeding for breeders and breeding teams.

  • 相关文献
作者其他论文 更多>>