Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach

Prashant Shivaram Bhat; Bahram Zonooz; Elahe Arani

Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach

Prashant Shivaram Bhat, Bahram Zonooz, Elahe Arani

Proceedings of The 1st Conference on Lifelong Learning Agents, PMLR 199:390-405, 2022.

Abstract

Continual learning (CL) over non-stationary data streams remains one of the long-standing challenges in deep neural networks (DNNs) as they are prone to catastrophic forgetting. CL models can benefit from self-supervised pre-training as it enables learning more generalizable task-agnostic features. However, the effect of self-supervised pre-training diminishes as the length of task sequences increases. Furthermore, the domain shift between pre-training data distribution and the task distribution reduces the generalizability of the learned representations. To address these limitations, we propose Task Agnostic Representation Consolidation (TARC), a novel two-stage training paradigm for CL that intertwines task-agnostic and task-specific learning whereby self-supervised training is followed by supervised learning for each task. To further restrict the deviation from the learned representations in the self-supervised stage, we employ a task-agnostic auxiliary loss during the supervised stage. We show that our training paradigm can be easily added to memory- or regularization-based approaches and provides consistent performance gain across more challenging CL settings. We further show that it leads to more robust and well-calibrated models.

Cite this Paper

BibTeX


@InProceedings{pmlr-v199-bhat22a,
  title = 	 {Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach},
  author =       {Bhat, Prashant Shivaram and Zonooz, Bahram and Arani, Elahe},
  booktitle = 	 {Proceedings of The 1st Conference on Lifelong Learning Agents},
  pages = 	 {390--405},
  year = 	 {2022},
  editor = 	 {Chandar, Sarath and Pascanu, Razvan and Precup, Doina},
  volume = 	 {199},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {22--24 Aug},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v199/bhat22a/bhat22a.pdf},
  url = 	 {https://proceedings.mlr.press/v199/bhat22a.html},
  abstract = 	 {Continual learning (CL) over non-stationary data streams remains one of the long-standing challenges in deep neural networks (DNNs) as they are prone to catastrophic forgetting. CL models can benefit from self-supervised pre-training as it enables learning more generalizable task-agnostic features. However, the effect of self-supervised pre-training diminishes as the length of task sequences increases. Furthermore, the domain shift between pre-training data distribution and the task distribution reduces the generalizability of the learned representations. To address these limitations, we propose Task Agnostic Representation Consolidation (TARC), a novel two-stage training paradigm for CL that intertwines task-agnostic and task-specific learning whereby self-supervised training is followed by supervised learning for each task. To further restrict the deviation from the learned representations in the self-supervised stage, we employ a task-agnostic auxiliary loss during the supervised stage. We show that our training paradigm can be easily added to memory- or regularization-based approaches and provides consistent performance gain across more challenging CL settings. We further show that it leads to more robust and well-calibrated models.}
}

Endnote

%0 Conference Paper
%T Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach
%A Prashant Shivaram Bhat
%A Bahram Zonooz
%A Elahe Arani
%B Proceedings of The 1st Conference on Lifelong Learning Agents
%C Proceedings of Machine Learning Research
%D 2022
%E Sarath Chandar
%E Razvan Pascanu
%E Doina Precup	
%F pmlr-v199-bhat22a
%I PMLR
%P 390--405
%U https://proceedings.mlr.press/v199/bhat22a.html
%V 199
%X Continual learning (CL) over non-stationary data streams remains one of the long-standing challenges in deep neural networks (DNNs) as they are prone to catastrophic forgetting. CL models can benefit from self-supervised pre-training as it enables learning more generalizable task-agnostic features. However, the effect of self-supervised pre-training diminishes as the length of task sequences increases. Furthermore, the domain shift between pre-training data distribution and the task distribution reduces the generalizability of the learned representations. To address these limitations, we propose Task Agnostic Representation Consolidation (TARC), a novel two-stage training paradigm for CL that intertwines task-agnostic and task-specific learning whereby self-supervised training is followed by supervised learning for each task. To further restrict the deviation from the learned representations in the self-supervised stage, we employ a task-agnostic auxiliary loss during the supervised stage. We show that our training paradigm can be easily added to memory- or regularization-based approaches and provides consistent performance gain across more challenging CL settings. We further show that it leads to more robust and well-calibrated models.

APA


Bhat, P.S., Zonooz, B. & Arani, E.. (2022). Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach. Proceedings of The 1st Conference on Lifelong Learning Agents, in Proceedings of Machine Learning Research 199:390-405 Available from https://proceedings.mlr.press/v199/bhat22a.html.

Task Agnostic Representation Consolidation: a Self-supervised based Continual Learning Approach

Abstract

Cite this Paper

Related Material