Monocular Depth Estimation with Self-Supervised Learning for Vineyard Unmanned Agricultural Vehicle
文献类型: 外文期刊
第一作者: Cui, Xue-Zhi
作者: Cui, Xue-Zhi;Feng, Quan;Wang, Shu-Zhi;Zhang, Jian-Hua
作者机构:
关键词: edge computing device; monocular depth estimation; self-supervised learning; vineyard scene
期刊名称:SENSORS ( 影响因子:3.847; 五年影响因子:4.05 )
ISSN:
年卷期: 2022 年 22 卷 3 期
页码:
收录情况: SCI
摘要: To find an economical solution to infer the depth of the surrounding environment of unmanned agricultural vehicles (UAV), a lightweight depth estimation model called MonoDA based on a convolutional neural network is proposed. A series of sequential frames from monocular videos are used to train the model. The model is composed of two subnetworks-the depth estimation subnetwork and the pose estimation subnetwork. The former is a modified version of U-Net that reduces the number of bridges, while the latter takes EfficientNet-B0 as its backbone network to extract the features of sequential frames and predict the pose transformation relations between the frames. The self-supervised strategy is adopted during the training, which means the depth information labels of frames are not needed. Instead, the adjacent frames in the image sequence and the reprojection relation of the pose are used to train the model. Subnetworks' outputs (depth map and pose relation) are used to reconstruct the input frame, then a self-supervised loss between the reconstructed input and the original input is calculated. Finally, the loss is employed to update the parameters of the two subnetworks through the backward pass. Several experiments are conducted to evaluate the model's performance, and the results show that MonoDA has competitive accuracy over the KITTI raw dataset as well as our vineyard dataset. Besides, our method also possessed the advantage of non-sensitivity to color. On the computing platform of our UAV's environment perceptual system NVIDIA JETSON TX2, the model could run at 18.92 FPS. To sum up, our approach provides an economical solution for depth estimation by using monocular cameras, which achieves a good trade-off between accuracy and speed and can be used as a novel auxiliary depth detection paradigm for UAVs.
分类号:
- 相关文献
作者其他论文 更多>>
-
EMSAM: enhanced multi-scale segment anything model for leaf disease segmentation
作者:Li, Junlong;Feng, Quan;Yang, Sen;Zhang, Jianhua;Zhang, Jianhua
关键词:segment anything model; parameter efficient fine-tuning; adapter tuning; leaf disease segmentation; multi-task learning
-
Adaptive Tracking and Cutting Control System for Tea Canopy: Design and Experimental Evaluation
作者:Zhang, Danzhu;Chen, Liping;Feng, Quan;Zhang, Ruirui;Chen, Liping;Zhang, Linhuan;Yi, Tongchuan
关键词:tea harvesting; tea picker; height tracking; roll angle adaptation
-
Potato Plant Variety Identification Study Based on Improved Swin Transformer
作者:Xing, Xue;Liu, Chengzhong;Han, Junying;Ma, Baixiong;Feng, Quan;Qi, Enfang;Qu, Yaying
关键词:deep learning; potato; Swin Transformer; convolutional neural network; variety identification
-
Few-shot crop disease recognition using sequence- weighted ensemble model-agnostic meta-learning
作者:Li, Junlong;Feng, Quan;Yang, Junqi;Yang, Sen;Zhang, Jianhua;Zhang, Jianhua
关键词:crop disease recognition; few-shot learning; meta-learning; ensemble learning; sequence-weighted ensemble
-
Few-shot disease recognition algorithm based on supervised contrastive learning
作者:Mu, Jiawei;Feng, Quan;Yang, Junqi;Yang, Sen;Zhang, Jianhua;Zhang, Jianhua
关键词:plant disease recognition; few-shot learning; meta-learning; supervised contrastive learning; nearest-centroid classification
-
An effective unsupervised domain adaptation for in-field potato disease recognition
作者:Gao, Xueze;Feng, Quan;Yang, Sen;Wang, Shuzhi;Zhang, Jianhua;Zhang, Jianhua
关键词:Field environment; Domain shift; Subdomain; Multi-representation; Light variation; Disease progression
-
From laboratory to field: cross-domain few-shot learning for crop disease identification in the field
作者:Yang, Sen;Feng, Quan;Yang, Wanxia;Zhou, Wenwei;Yan, Wenbo;Zhang, Jianhua
关键词:cross-domain; few-shot learning; crop diseases; recognition; multi-domain