Identifying Tomato Growth Stages in Protected Agriculture with StyleGAN3-Synthetic Images and Vision Transformer

文献类型: 外文期刊

第一作者: Huo, Yao

作者: Huo, Yao;Liu, Yongbo;He, Peng;Hu, Liang;Gao, Wenbo;Gu, Le

作者机构:

关键词: StyleGAN3; ViT; deep learning; tomato

期刊名称:AGRICULTURE-BASEL ( 影响因子:3.6; 五年影响因子:3.8 )

ISSN:

年卷期: 2025 年 15 卷 2 期

页码:

收录情况: SCI

摘要: In protected agriculture, accurately identifying the key growth stages of tomatoes plays a significant role in achieving efficient management and high-precision production. However, traditional approaches often face challenges like non-standardized data collection, unbalanced datasets, low recognition efficiency, and limited accuracy. This paper proposes an innovative solution combining generative adversarial networks (GANs) and deep learning techniques to address these challenges. Specifically, the StyleGAN3 model is employed to generate high-quality images of tomato growth stages, effectively augmenting the original dataset with a broader range of images. This augmented dataset is then processed using a Vision Transformer (ViT) model for intelligent recognition of tomato growth stages within a protected agricultural environment. The proposed method was tested on 2723 images, demonstrating that the generated images are nearly indistinguishable from real images. The combined training approach incorporating both generated and original images produced superior recognition results compared to training with only the original images. The validation set achieved an accuracy of 99.6%, while the test set achieved 98.39%, marking improvements of 22.85%, 3.57%, and 3.21% over AlexNet, DenseNet50, and VGG16, respectively. The average detection speed was 9.5 ms. This method provides a highly effective means of identifying tomato growth stages in protected environments and offers valuable insights for improving the efficiency and quality of protected crop production.

分类号:

  • 相关文献
作者其他论文 更多>>