NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations

Myunsoo Kim, Hayeong Lee, Seong-Woong Shim, Junho Seo, Byung-Jun Lee
Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:30437-30461, 2025.

Abstract

Intelligent agents are able to make decisions based on different levels of granularity and duration. Recent advances in skill learning enabled the agent to solve complex, long-horizon tasks by effectively guiding the agent in choosing appropriate skills. However, the practice of using fixed-length skills can easily result in skipping valuable decision points, which ultimately limits the potential for further exploration and faster policy learning. In this work, we propose to learn a simple and effective termination condition that identifies decision points through a state-action novelty module that leverages agent experience data. Our approach, Novelty-based Decision Point Identification (NBDI), outperforms previous baselines in complex, long-horizon tasks, and remains effective even in the presence of significant variations in the environment configurations of downstream tasks, highlighting the importance of decision point identification in skill learning.

Cite this Paper


BibTeX
@InProceedings{pmlr-v267-kim25v, title = {{NBDI}: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations}, author = {Kim, Myunsoo and Lee, Hayeong and Shim, Seong-Woong and Seo, Junho and Lee, Byung-Jun}, booktitle = {Proceedings of the 42nd International Conference on Machine Learning}, pages = {30437--30461}, year = {2025}, editor = {Singh, Aarti and Fazel, Maryam and Hsu, Daniel and Lacoste-Julien, Simon and Berkenkamp, Felix and Maharaj, Tegan and Wagstaff, Kiri and Zhu, Jerry}, volume = {267}, series = {Proceedings of Machine Learning Research}, month = {13--19 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v267/main/assets/kim25v/kim25v.pdf}, url = {https://proceedings.mlr.press/v267/kim25v.html}, abstract = {Intelligent agents are able to make decisions based on different levels of granularity and duration. Recent advances in skill learning enabled the agent to solve complex, long-horizon tasks by effectively guiding the agent in choosing appropriate skills. However, the practice of using fixed-length skills can easily result in skipping valuable decision points, which ultimately limits the potential for further exploration and faster policy learning. In this work, we propose to learn a simple and effective termination condition that identifies decision points through a state-action novelty module that leverages agent experience data. Our approach, Novelty-based Decision Point Identification (NBDI), outperforms previous baselines in complex, long-horizon tasks, and remains effective even in the presence of significant variations in the environment configurations of downstream tasks, highlighting the importance of decision point identification in skill learning.} }
Endnote
%0 Conference Paper %T NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations %A Myunsoo Kim %A Hayeong Lee %A Seong-Woong Shim %A Junho Seo %A Byung-Jun Lee %B Proceedings of the 42nd International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2025 %E Aarti Singh %E Maryam Fazel %E Daniel Hsu %E Simon Lacoste-Julien %E Felix Berkenkamp %E Tegan Maharaj %E Kiri Wagstaff %E Jerry Zhu %F pmlr-v267-kim25v %I PMLR %P 30437--30461 %U https://proceedings.mlr.press/v267/kim25v.html %V 267 %X Intelligent agents are able to make decisions based on different levels of granularity and duration. Recent advances in skill learning enabled the agent to solve complex, long-horizon tasks by effectively guiding the agent in choosing appropriate skills. However, the practice of using fixed-length skills can easily result in skipping valuable decision points, which ultimately limits the potential for further exploration and faster policy learning. In this work, we propose to learn a simple and effective termination condition that identifies decision points through a state-action novelty module that leverages agent experience data. Our approach, Novelty-based Decision Point Identification (NBDI), outperforms previous baselines in complex, long-horizon tasks, and remains effective even in the presence of significant variations in the environment configurations of downstream tasks, highlighting the importance of decision point identification in skill learning.
APA
Kim, M., Lee, H., Shim, S., Seo, J. & Lee, B.. (2025). NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations. Proceedings of the 42nd International Conference on Machine Learning, in Proceedings of Machine Learning Research 267:30437-30461 Available from https://proceedings.mlr.press/v267/kim25v.html.

Related Material