您好,欢迎访问北京市农林科学院 机构知识库!

Maize yield prediction with trait-missing data via bipartite graph neural network

文献类型: 外文期刊

作者: Wang, Kaiyi 1 ; Han, Yanyun 1 ; Zhang, Yuqing 3 ; Zhang, Yong 3 ; Wang, Shufeng 1 ; Yang, Feng 1 ; Liu, Chunqing 4 ; Zhang, Dongfeng 1 ; Lu, Tiangang 5 ; Zhang, Like 4 ; Liu, Zhongqiang 1 ;

作者机构: 1.Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing, Peoples R China

2.Natl Innovat Ctr Digital Seed Ind, Beijing, Peoples R China

3.Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China

4.Natl Agrotech Extens & Serv Ctr, Beijing, Peoples R China

5.Beijing Municipal Bur Agr & Rural Affairs, Beijing Digital Agr Rural Promot Ctr, Beijing, Peoples R China

关键词: yield prediction; graph neural network; bipartite graph; data imputation; gradient harmonization

期刊名称:FRONTIERS IN PLANT SCIENCE ( 影响因子:4.8; 五年影响因子:5.7 )

ISSN: 1664-462X

年卷期: 2024 年 15 卷

页码:

收录情况: SCI

摘要: The timely and accurate prediction of maize (Zea mays L.) yields prior to harvest is critical for food security and agricultural policy development. Currently, many researchers are using machine learning and deep learning to predict maize yields in specific regions with high accuracy. However, existing methods typically have two limitations. One is that they ignore the extensive correlation in maize planting data, such as the association of maize yields between adjacent planting locations and the combined effect of meteorological features and maize traits on maize yields. The other issue is that the performance of existing models may suffer significantly when some data in maize planting records is missing, or the samples are unbalanced. Therefore, this paper proposes an end-to-end bipartite graph neural network-based model for trait data imputation and yield prediction. The maize planting data is initially converted to a bipartite graph data structure. Then, a yield prediction model based on a bipartite graph neural network is developed to impute missing trait data and predict maize yield. This model can mine correlations between different samples of data, correlations between different meteorological features and traits, and correlations between different traits. Finally, to address the issue of unbalanced sample size at each planting location, we propose a loss function based on the gradient balancing mechanism that effectively reduces the impact of data imbalance on the prediction model. When compared to other data imputation and prediction models, our method achieves the best yield prediction result even when missing data is not pre-processed.

  • 相关文献
作者其他论文 更多>>