Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning

Sungjune Kim; Gyeongrok Oh; Heeju Ko; Daehyun Ji; Dongwook Lee; Byung-Jun Lee; Sujin Jang; Sangpil Kim

Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning

Sungjune Kim, Gyeongrok Oh, Heeju Ko, Daehyun Ji, Dongwook Lee, Byung-Jun Lee, Sujin Jang, Sangpil Kim

Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:30654-30671, 2025.

Abstract

Navigating in an unfamiliar environment during deployment poses a critical challenge for a vision-language navigation (VLN) agent. Yet, test-time adaptation (TTA) remains relatively underexplored in robotic navigation, leading us to the fundamental question: what are the key properties of TTA for online VLN? In our view, effective adaptation requires three qualities: 1) flexibility in handling different navigation outcomes, 2) interactivity with external environment, and 3) maintaining a harmony between plasticity and stability. To address this, we introduce FeedTTA, a novel TTA framework for online VLN utilizing feedback-based reinforcement learning. Specifically, FeedTTA learns by maximizing binary episodic feedback, a practical setup in which the agent receives a binary scalar after each episode that indicates the success or failure of the navigation. Additionally, we propose a gradient regularization technique that leverages the binary structure of FeedTTA to achieve a balance between plasticity and stability during adaptation. Our extensive experiments on challenging VLN benchmarks demonstrate the superior adaptability of FeedTTA, even outperforming the state-of-the-art offline training methods in REVERIE benchmark with a single stream of learning.

Cite this Paper

BibTeX

@InProceedings{pmlr-v267-kim25ad,
  title = 	 {Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning},
  author =       {Kim, Sungjune and Oh, Gyeongrok and Ko, Heeju and Ji, Daehyun and Lee, Dongwook and Lee, Byung-Jun and Jang, Sujin and Kim, Sangpil},
  booktitle = 	 {Proceedings of the 42nd International Conference on Machine Learning},
  pages = 	 {30654--30671},
  year = 	 {2025},
  editor = 	 {Singh, Aarti and Fazel, Maryam and Hsu, Daniel and Lacoste-Julien, Simon and Berkenkamp, Felix and Maharaj, Tegan and Wagstaff, Kiri and Zhu, Jerry},
  volume = 	 {267},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--19 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v267/main/assets/kim25ad/kim25ad.pdf},
  url = 	 {https://proceedings.mlr.press/v267/kim25ad.html},
  abstract = 	 {Navigating in an unfamiliar environment during deployment poses a critical challenge for a vision-language navigation (VLN) agent. Yet, test-time adaptation (TTA) remains relatively underexplored in robotic navigation, leading us to the fundamental question: what are the key properties of TTA for online VLN? In our view, effective adaptation requires three qualities: 1) flexibility in handling different navigation outcomes, 2) interactivity with external environment, and 3) maintaining a harmony between plasticity and stability. To address this, we introduce FeedTTA, a novel TTA framework for online VLN utilizing feedback-based reinforcement learning. Specifically, FeedTTA learns by maximizing binary episodic feedback, a practical setup in which the agent receives a binary scalar after each episode that indicates the success or failure of the navigation. Additionally, we propose a gradient regularization technique that leverages the binary structure of FeedTTA to achieve a balance between plasticity and stability during adaptation. Our extensive experiments on challenging VLN benchmarks demonstrate the superior adaptability of FeedTTA, even outperforming the state-of-the-art offline training methods in REVERIE benchmark with a single stream of learning.}
}

Endnote

%0 Conference Paper
%T Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning
%A Sungjune Kim
%A Gyeongrok Oh
%A Heeju Ko
%A Daehyun Ji
%A Dongwook Lee
%A Byung-Jun Lee
%A Sujin Jang
%A Sangpil Kim
%B Proceedings of the 42nd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2025
%E Aarti Singh
%E Maryam Fazel
%E Daniel Hsu
%E Simon Lacoste-Julien
%E Felix Berkenkamp
%E Tegan Maharaj
%E Kiri Wagstaff
%E Jerry Zhu	
%F pmlr-v267-kim25ad
%I PMLR
%P 30654--30671
%U https://proceedings.mlr.press/v267/kim25ad.html
%V 267
%X Navigating in an unfamiliar environment during deployment poses a critical challenge for a vision-language navigation (VLN) agent. Yet, test-time adaptation (TTA) remains relatively underexplored in robotic navigation, leading us to the fundamental question: what are the key properties of TTA for online VLN? In our view, effective adaptation requires three qualities: 1) flexibility in handling different navigation outcomes, 2) interactivity with external environment, and 3) maintaining a harmony between plasticity and stability. To address this, we introduce FeedTTA, a novel TTA framework for online VLN utilizing feedback-based reinforcement learning. Specifically, FeedTTA learns by maximizing binary episodic feedback, a practical setup in which the agent receives a binary scalar after each episode that indicates the success or failure of the navigation. Additionally, we propose a gradient regularization technique that leverages the binary structure of FeedTTA to achieve a balance between plasticity and stability during adaptation. Our extensive experiments on challenging VLN benchmarks demonstrate the superior adaptability of FeedTTA, even outperforming the state-of-the-art offline training methods in REVERIE benchmark with a single stream of learning.

APA

Kim, S., Oh, G., Ko, H., Ji, D., Lee, D., Lee, B., Jang, S. & Kim, S.. (2025). Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning. Proceedings of the 42nd International Conference on Machine Learning, in Proceedings of Machine Learning Research 267:30654-30671 Available from https://proceedings.mlr.press/v267/kim25ad.html.

Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning

Abstract

Cite this Paper

Related Material