ASD-Conv: Monocular 3D object detection network based on Asymmetrical Segmentation Depth-aware Convolution

Yu Xingyuan; Du Neng; Gao Ge; Wen Fan

ASD-Conv: Monocular 3D object detection network based on Asymmetrical Segmentation Depth-aware Convolution

Yu Xingyuan, Du Neng, Gao Ge, Wen Fan

Proceedings of The 13th Asian Conference on Machine Learning, PMLR 157:642-655, 2021.

Abstract

In the field of 3D object recognition, monocular 3D recognition technology is a valuable recognition technology. Compared with binocular technology and lidar technology, its cost is lower. In this paper, based on the existing monocular 3D recognition network, we propose an asymmetrical segmentation depth-aware network: ASD-Conv Network, which is used to better obtain the depth information of monocular images, so as to obtain better recognition results. Compared with other monocular recognition networks, ASD-Conv network performs special segmentation on the image, which can better obtain the depth distribution of the image, and has made a good breakthrough and improvement in the image recognition tasks of 2D, BEV and 3D. The improved algorithm proposed in this paper can improve the detection accuracy while maintaining a certain real-time performance. Experimental results show that compared with the current model, the proposed monocular 3D object detection algorithm based on D-ASDConv has an average improvement rate of 2.82%(AP) in large object detection and the highest average improvement rate of 2.01%(AP) in small object detection on Kitti dataset. The algorithm can effectively learn more advanced features of spatial perception, and the detection results of monocular images are more accurate.

Cite this Paper

BibTeX

@InProceedings{pmlr-v157-xingyuan21a,
  title = 	 {ASD-Conv: Monocular 3D object detection network based on Asymmetrical Segmentation Depth-aware Convolution},
  author =       {Xingyuan, Yu and Neng, Du and Ge, Gao and Fan, Wen},
  booktitle = 	 {Proceedings of The 13th Asian Conference on Machine Learning},
  pages = 	 {642--655},
  year = 	 {2021},
  editor = 	 {Balasubramanian, Vineeth N. and Tsang, Ivor},
  volume = 	 {157},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {17--19 Nov},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v157/xingyuan21a/xingyuan21a.pdf},
  url = 	 {https://proceedings.mlr.press/v157/xingyuan21a.html},
  abstract = 	 {In the field of 3D object recognition, monocular 3D recognition technology is a valuable recognition technology. Compared with binocular technology and lidar technology, its cost is lower. In this paper, based on the existing monocular 3D recognition network, we propose an asymmetrical segmentation depth-aware network: ASD-Conv Network, which is used to better obtain the depth information of monocular images, so as to obtain better recognition results. Compared with other monocular recognition networks, ASD-Conv network performs special segmentation on the image, which can better obtain the depth distribution of the image, and has made a good breakthrough and improvement in the image recognition tasks of 2D, BEV and 3D. The improved algorithm proposed in this paper can improve the detection accuracy while maintaining a certain real-time performance. Experimental results show that compared with the current model, the proposed monocular 3D object detection algorithm based on D-ASDConv has an average improvement rate of 2.82%(AP) in large object detection and the highest average improvement rate of 2.01%(AP) in small object detection on Kitti dataset. The algorithm can effectively learn more advanced features of spatial perception, and the detection results of monocular images are more accurate.}
}

Endnote

%0 Conference Paper
%T ASD-Conv: Monocular 3D object detection network based on Asymmetrical Segmentation Depth-aware Convolution
%A Yu Xingyuan
%A Du Neng
%A Gao Ge
%A Wen Fan
%B Proceedings of The 13th Asian Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2021
%E Vineeth N. Balasubramanian
%E Ivor Tsang	
%F pmlr-v157-xingyuan21a
%I PMLR
%P 642--655
%U https://proceedings.mlr.press/v157/xingyuan21a.html
%V 157
%X In the field of 3D object recognition, monocular 3D recognition technology is a valuable recognition technology. Compared with binocular technology and lidar technology, its cost is lower. In this paper, based on the existing monocular 3D recognition network, we propose an asymmetrical segmentation depth-aware network: ASD-Conv Network, which is used to better obtain the depth information of monocular images, so as to obtain better recognition results. Compared with other monocular recognition networks, ASD-Conv network performs special segmentation on the image, which can better obtain the depth distribution of the image, and has made a good breakthrough and improvement in the image recognition tasks of 2D, BEV and 3D. The improved algorithm proposed in this paper can improve the detection accuracy while maintaining a certain real-time performance. Experimental results show that compared with the current model, the proposed monocular 3D object detection algorithm based on D-ASDConv has an average improvement rate of 2.82%(AP) in large object detection and the highest average improvement rate of 2.01%(AP) in small object detection on Kitti dataset. The algorithm can effectively learn more advanced features of spatial perception, and the detection results of monocular images are more accurate.

APA

Xingyuan, Y., Neng, D., Ge, G. & Fan, W.. (2021). ASD-Conv: Monocular 3D object detection network based on Asymmetrical Segmentation Depth-aware Convolution. Proceedings of The 13th Asian Conference on Machine Learning, in Proceedings of Machine Learning Research 157:642-655 Available from https://proceedings.mlr.press/v157/xingyuan21a.html.

Related Material

Download PDF