文献类型: 外文期刊
作者: Yang, Xiaowei 1 ; He, Lin 1 ; Zhao, Yong 4 ; Sang, Haiwei 5 ; Yang, Zuliu 4 ; Cheng, Xianjing 6 ;
作者机构: 1.Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R China
2.Guizhou Acad Agr Sci, Guizhou Tea Res Inst, Guiyang 550006, Peoples R China
3.Liupanshui Normal Coll, Liupanshui 553004, Peoples R China
4.Peking Univ, Shenzhen Grad Sch, Sch Elect & Comp Engn, Shenzhen 518055, Peoples R China
5.Guizhou Educ Univ, Sch Math & Big Data, Guiyang 550018, Peoples R China
6.Zunyi Normal Univ, Res Inst Qianbei Informat Technol, Zunyi 563006, Guizhou, Peoples R China
关键词: Feature extraction; Three-dimensional displays; Convolution; Data mining; Image edge detection; Convolutional neural networks; Neural network; stereo matching; multi-scale attention module; feature refinement module; 3D attention aggregation module
期刊名称:IEEE ACCESS ( 影响因子:3.367; 五年影响因子:3.671 )
ISSN: 2169-3536
年卷期: 2020 年 8 卷
页码:
收录情况: SCI
摘要: In recent years, convolutional neural network (CNN) algorithms promote the development of stereo matching and make great progress, but some mismatches still occur in textureless, occluded and reflective regions. In feature extraction and cost aggregation, CNNs will greatly improve the accuracy of stereo matching by utilizing global context information and high-quality feature representations. In this paper, we design a novel end-to-end stereo matching algorithm named Multi-Attention Network (MAN). To obtain the global context information in detail at the pixel-level, we propose a Multi-Scale Attention Module (MSAM), combining a spatial pyramid module with an attention mechanism, when we extract the image features. In addition, we introduce a feature refinement module (FRM) and a 3D attention aggregation module (3D AAM) during cost aggregation so that the network can extract informative features with high representational ability and high-quality channel attention vectors. Finally, we obtain the final disparity through bilinear interpolation and disparity regression. We evaluate our method on the Scene Flow, KITTI 2012 and KITTI 2015 stereo datasets. The experimental results show that our method achieves state-of-the-art performance and that every component of our network is effective.
- 相关文献
作者其他论文 更多>>
-
A light-weight stereo matching network based on multi-scale features fusion and robust disparity refinement
作者:Yang, Xiaowei;Feng, Zhiguo;Zhang, Zhenbo;He, Lin;Yang, Xiaowei;Zhao, Yong;Sang, Haiwei;Zhang, Guiying;He, Lin;Feng, Zhiguo
关键词:computer vision; image processing; stereo image processing
-
The mechanism of coumarin inhibits germination of ryegrass (Lolium perenne) and its application as coumarin-carbon dots nanocomposites
作者:Yang, Ni;Ran, Lulu;Yang, Furong;Shen, Guangmao;Zhang, Ping;He, Lin;Qian, Kun;He, Xiulong;Ma, Chuanxin;Chen, Hanqiu;Xiang, Dong
关键词:coumarin; ryegrass; germination; hormone pathway; carbon dots
-
Edge supervision and multi-scale cost volume for stereo matching
作者:Yang, Xiaowei;Feng, Zhiguo;He, Lin;Yang, Xiaowei;Zhao, Yong;Zhang, Guiying;He, Lin
关键词:Stereo matching; Geometric constraints; Multi-scale cost volume; Disparity refinement network