TAG: Task-based Accumulated Gradients for Lifelong learning

Pranshu Malviya; Balaraman Ravindran; Sarath Chandar

TAG: Task-based Accumulated Gradients for Lifelong learning

Pranshu Malviya, Balaraman Ravindran, Sarath Chandar

Proceedings of The 1st Conference on Lifelong Learning Agents, PMLR 199:366-389, 2022.

Abstract

When an agent encounters a continual stream of new tasks in the lifelong learning setting, it leverages the knowledge it gained from the earlier tasks to help learn the new tasks better. In such a scenario, identifying an efficient knowledge representation becomes a challenging problem. Most research works propose to either store a subset of examples from the past tasks in a replay buffer, dedicate a separate set of parameters to each task or penalize excessive updates over parameters by introducing a regularization term. While existing methods employ the general task-agnostic stochastic gradient descent update rule, we propose a task-aware optimizer that adapts the learning rate based on the relatedness among tasks. We utilize the directions taken by the parameters during the updates by additively accumulating the gradients specific to each task. These task-based accumulated gradients act as a knowledge base that is maintained and updated throughout the stream. We empirically show that our proposed adaptive learning rate not only accounts for catastrophic forgetting but also exhibits knowledge transfer. We also show that our method performs better than several state-of-the-art methods in lifelong learning on complex datasets. Moreover, our method can also be combined with the existing methods and achieve substantial improvement in performance.

Cite this Paper

BibTeX


@InProceedings{pmlr-v199-malviya22a,
  title = 	 {TAG: Task-based Accumulated Gradients for Lifelong learning},
  author =       {Malviya, Pranshu and Ravindran, Balaraman and Chandar, Sarath},
  booktitle = 	 {Proceedings of The 1st Conference on Lifelong Learning Agents},
  pages = 	 {366--389},
  year = 	 {2022},
  editor = 	 {Chandar, Sarath and Pascanu, Razvan and Precup, Doina},
  volume = 	 {199},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {22--24 Aug},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v199/malviya22a/malviya22a.pdf},
  url = 	 {https://proceedings.mlr.press/v199/malviya22a.html},
  abstract = 	 {When an agent encounters a continual stream of new tasks in the lifelong learning setting, it leverages the knowledge it gained from the earlier tasks to help learn the new tasks better. In such a scenario, identifying an efficient knowledge representation becomes a challenging problem. Most research works propose to either store a subset of examples from the past tasks in a replay buffer, dedicate a separate set of parameters to each task or penalize excessive updates over parameters by introducing a regularization term. While existing methods employ the general task-agnostic stochastic gradient descent update rule, we propose a task-aware optimizer that adapts the learning rate based on the relatedness among tasks. We utilize the directions taken by the parameters during the updates by additively accumulating the gradients specific to each task. These task-based accumulated gradients act as a knowledge base that is maintained and updated throughout the stream. We empirically show that our proposed adaptive learning rate not only accounts for catastrophic forgetting but also exhibits knowledge transfer. We also show that our method performs better than several state-of-the-art methods in lifelong learning on complex datasets. Moreover, our method can also be combined with the existing methods and achieve substantial improvement in performance.}
}

Endnote

%0 Conference Paper
%T TAG: Task-based Accumulated Gradients for Lifelong learning
%A Pranshu Malviya
%A Balaraman Ravindran
%A Sarath Chandar
%B Proceedings of The 1st Conference on Lifelong Learning Agents
%C Proceedings of Machine Learning Research
%D 2022
%E Sarath Chandar
%E Razvan Pascanu
%E Doina Precup	
%F pmlr-v199-malviya22a
%I PMLR
%P 366--389
%U https://proceedings.mlr.press/v199/malviya22a.html
%V 199
%X When an agent encounters a continual stream of new tasks in the lifelong learning setting, it leverages the knowledge it gained from the earlier tasks to help learn the new tasks better. In such a scenario, identifying an efficient knowledge representation becomes a challenging problem. Most research works propose to either store a subset of examples from the past tasks in a replay buffer, dedicate a separate set of parameters to each task or penalize excessive updates over parameters by introducing a regularization term. While existing methods employ the general task-agnostic stochastic gradient descent update rule, we propose a task-aware optimizer that adapts the learning rate based on the relatedness among tasks. We utilize the directions taken by the parameters during the updates by additively accumulating the gradients specific to each task. These task-based accumulated gradients act as a knowledge base that is maintained and updated throughout the stream. We empirically show that our proposed adaptive learning rate not only accounts for catastrophic forgetting but also exhibits knowledge transfer. We also show that our method performs better than several state-of-the-art methods in lifelong learning on complex datasets. Moreover, our method can also be combined with the existing methods and achieve substantial improvement in performance.

APA


Malviya, P., Ravindran, B. & Chandar, S.. (2022). TAG: Task-based Accumulated Gradients for Lifelong learning. Proceedings of The 1st Conference on Lifelong Learning Agents, in Proceedings of Machine Learning Research 199:366-389 Available from https://proceedings.mlr.press/v199/malviya22a.html.

TAG: Task-based Accumulated Gradients for Lifelong learning

Abstract

Cite this Paper

Related Material