A light-weight stereo matching network based on multi-scale features fusion and robust disparity refinement
文献类型: 外文期刊
作者: Yang, Xiaowei 1 ; Zhao, Yong 3 ; Feng, Zhiguo 1 ; Sang, Haiwei 4 ; Zhang, Zhenbo 1 ; Zhang, Guiying 5 ; He, Lin 1 ;
作者机构: 1.Guizhou Univ, Sch Mech Engn, Guiyang, Peoples R China
2.Guizhou Acad Agr Sci, Guizhou Tea Res Inst, Guiyang, Peoples R China
3.Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen, Peoples R China
4.Guizhou Educ Univ, Sch Math & Big Data, Guiyang, Peoples R China
5.Guangzhou Med Univ, Qingyuan Peoples Hosp, Affiliated Hosp 6, Sch Biomed Engn, Guangzhou, Peoples R China
6.Liupanshui Normal Coll, Sch Min & Civil Engn, Liupanshui, Peoples R China
7.Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R China
关键词: computer vision; image processing; stereo image processing
期刊名称:IET IMAGE PROCESSING ( 影响因子:2.3; 五年影响因子:2.3 )
ISSN: 1751-9659
年卷期: 2023 年 17 卷 6 期
页码:
收录情况: SCI
摘要: In recent years, convolutional-neural-network based stereo matching methods have achieved significant gains compared to conventional methods in terms of both speed and accuracy. Current state-of-the-art disparity estimation algorithms require many parameters and large amounts of computational resources and are not suited for applications on edge devices. In this paper, an end-to-end light-weight network (LWNet) for fast stereo matching is proposed, which consists of an efficient backbone with multi-scale feature fusion for feature extraction, a 3D U-Net aggregation architecture for disparity computation, and color guidance in a 2D convolutional neural network (CNN) for disparity refinement. MobileNetV2 is adopted as an efficient backbone in feature extraction. The channel attention module is applied to improve the representational capacity of features and multi-resolution information is adaptively incorporated into the cost volume via cross-scale connections. Further, a left-right consistency check and color guidance refinement are introduced and a robust disparity refinement network is designed with skip connections and dilated convolutions to capture global context information and improve disparity estimation accuracy with little computational cost and memory space. Extensive experiments on Scene Flow, KITTI 2015, and KITTI 2012 demonstrate that the proposed LWNet achieves competitive accuracy and speed when compared with state-of-the-art stereo matching methods.
- 相关文献
作者其他论文 更多>>
-
The mechanism of coumarin inhibits germination of ryegrass (Lolium perenne) and its application as coumarin-carbon dots nanocomposites
作者:Yang, Ni;Ran, Lulu;Yang, Furong;Shen, Guangmao;Zhang, Ping;He, Lin;Qian, Kun;He, Xiulong;Ma, Chuanxin;Chen, Hanqiu;Xiang, Dong
关键词:coumarin; ryegrass; germination; hormone pathway; carbon dots
-
Edge supervision and multi-scale cost volume for stereo matching
作者:Yang, Xiaowei;Feng, Zhiguo;He, Lin;Yang, Xiaowei;Zhao, Yong;Zhang, Guiying;He, Lin
关键词:Stereo matching; Geometric constraints; Multi-scale cost volume; Disparity refinement network
-
Multi-Attention Network for Stereo Matching
作者:Yang, Xiaowei;He, Lin;Yang, Xiaowei;He, Lin;Zhao, Yong;Yang, Zuliu;Zhao, Yong;Sang, Haiwei;Cheng, Xianjing
关键词:Feature extraction; Three-dimensional displays; Convolution; Data mining; Image edge detection; Convolutional neural networks; Neural network; stereo matching; multi-scale attention module; feature refinement module; 3D attention aggregation module