Learning Road Scene-level Representations via Semantic Region Prediction

Zihao Xiao; Alan Yuille; Yi-Ting Chen

Learning Road Scene-level Representations via Semantic Region Prediction

Zihao Xiao, Alan Yuille, Yi-Ting Chen

Proceedings of The 6th Conference on Robot Learning, PMLR 205:1882-1892, 2023.

Abstract

In this work, we tackle two vital tasks in automated driving systems, i.e., driver intent prediction and risk object identification from egocentric images. Mainly, we investigate the question: what would be good road scene-level representations for these two tasks? We contend that a scene-level representation must capture higher-level semantic and geometric representations of traffic scenes around ego-vehicle while performing actions to their destinations. To this end, we introduce the representation of semantic regions, which are areas where ego-vehicles visit while taking an afforded action (e.g., left-turn at 4-way intersections). We propose to learn scene-level representations via a novel semantic region prediction task and an automatic semantic region labeling algorithm. Extensive evaluations are conducted on the HDD and nuScenes datasets, and the learned representations lead to state-of-the-art performance for driver intention prediction and risk object identification.

Cite this Paper

BibTeX


@InProceedings{pmlr-v205-xiao23b,
  title = 	 {Learning Road Scene-level Representations via Semantic Region Prediction},
  author =       {Xiao, Zihao and Yuille, Alan and Chen, Yi-Ting},
  booktitle = 	 {Proceedings of The 6th Conference on Robot Learning},
  pages = 	 {1882--1892},
  year = 	 {2023},
  editor = 	 {Liu, Karen and Kulic, Dana and Ichnowski, Jeff},
  volume = 	 {205},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {14--18 Dec},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v205/xiao23b/xiao23b.pdf},
  url = 	 {https://proceedings.mlr.press/v205/xiao23b.html},
  abstract = 	 {In this work, we tackle two vital tasks in automated driving systems, i.e., driver intent prediction and risk object identification from egocentric images. Mainly, we investigate the question: what would be good road scene-level representations for these two tasks? We contend that a scene-level representation must capture higher-level semantic and geometric representations of traffic scenes around ego-vehicle while performing actions to their destinations. To this end, we introduce the representation of semantic regions, which are areas where ego-vehicles visit while taking an afforded action (e.g., left-turn at 4-way intersections). We propose to learn scene-level representations via a novel semantic region prediction task and an automatic semantic region labeling algorithm. Extensive evaluations are conducted on the HDD and nuScenes datasets, and the learned representations lead to state-of-the-art performance for driver intention prediction and risk object identification. }
}

Endnote

%0 Conference Paper
%T Learning Road Scene-level Representations via Semantic Region Prediction
%A Zihao Xiao
%A Alan Yuille
%A Yi-Ting Chen
%B Proceedings of The 6th Conference on Robot Learning
%C Proceedings of Machine Learning Research
%D 2023
%E Karen Liu
%E Dana Kulic
%E Jeff Ichnowski	
%F pmlr-v205-xiao23b
%I PMLR
%P 1882--1892
%U https://proceedings.mlr.press/v205/xiao23b.html
%V 205
%X In this work, we tackle two vital tasks in automated driving systems, i.e., driver intent prediction and risk object identification from egocentric images. Mainly, we investigate the question: what would be good road scene-level representations for these two tasks? We contend that a scene-level representation must capture higher-level semantic and geometric representations of traffic scenes around ego-vehicle while performing actions to their destinations. To this end, we introduce the representation of semantic regions, which are areas where ego-vehicles visit while taking an afforded action (e.g., left-turn at 4-way intersections). We propose to learn scene-level representations via a novel semantic region prediction task and an automatic semantic region labeling algorithm. Extensive evaluations are conducted on the HDD and nuScenes datasets, and the learned representations lead to state-of-the-art performance for driver intention prediction and risk object identification.

APA


Xiao, Z., Yuille, A. & Chen, Y.. (2023). Learning Road Scene-level Representations via Semantic Region Prediction. Proceedings of The 6th Conference on Robot Learning, in Proceedings of Machine Learning Research 205:1882-1892 Available from https://proceedings.mlr.press/v205/xiao23b.html.

Learning Road Scene-level Representations via Semantic Region Prediction

Abstract

Cite this Paper

Related Material