Dynamic Forward and Backward Sparse Training
 (DFBST): Accelerated Deep Learning through
 Completely Sparse Training Schedule

Tejas Pote; Muhammad Athar Ganaie; Atif Hassan; Swanand Khare

Dynamic Forward and Backward Sparse Training (DFBST): Accelerated Deep Learning through Completely Sparse Training Schedule

Tejas Pote, Muhammad Athar Ganaie, Atif Hassan, Swanand Khare

Proceedings of The 14th Asian Conference on Machine Learning, PMLR 189:848-863, 2023.

Abstract

Neural network sparsification has received a lot of attention in recent years. A number of dynamic sparse training methods have been developed that achieve significant sparsity levels during training, ensuring comparable performance to their dense counterparts. However, most of these methods update all the model parameters using dense gradients. To this end, gradient sparsification is achieved either by non-dynamic (fixed) schedule or computationally expensive dynamic pruning schedule. To alleviate these drawbacks, we propose Dynamic Forward and Backward Sparse Training (DFBST), an algorithm which dynamically sparsifies both the forward and backward passes using trainable masks, leading to a completely sparse training schedule. In contrast to existing sparse training methods, we propose separate learning for forward as well as backward masks. Our approach achieves state of the art performance in terms of both accuracy and sparsity compared to existing dynamic pruning algorithms on benchmark datasets, namely MNIST, CIFAR-10 and CIFAR-100.

Cite this Paper

BibTeX


@InProceedings{pmlr-v189-pote23a,
  title = 	 {Dynamic Forward and Backward Sparse Training
 (DFBST): Accelerated Deep Learning through
 Completely Sparse Training Schedule},
  author =       {Pote, Tejas and Ganaie, Muhammad Athar and Hassan, Atif and Khare, Swanand},
  booktitle = 	 {Proceedings of The 14th Asian Conference on Machine
 Learning},
  pages = 	 {848--863},
  year = 	 {2023},
  editor = 	 {Khan, Emtiyaz and Gonen, Mehmet},
  volume = 	 {189},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {12--14 Dec},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v189/pote23a/pote23a.pdf},
  url = 	 {https://proceedings.mlr.press/v189/pote23a.html},
  abstract = 	 {Neural network sparsification has received a lot of
 attention in recent years. A number of dynamic
 sparse training methods have been developed that
 achieve significant sparsity levels during training,
 ensuring comparable performance to their dense
 counterparts. However, most of these methods update
 all the model parameters using dense gradients. To
 this end, gradient sparsification is achieved either
 by non-dynamic (fixed) schedule or computationally
 expensive dynamic pruning schedule. To alleviate
 these drawbacks, we propose Dynamic Forward and
 Backward Sparse Training (DFBST), an algorithm which
 dynamically sparsifies both the forward and backward
 passes using trainable masks, leading to a
 completely sparse training schedule. In contrast to
 existing sparse training methods, we propose
 separate learning for forward as well as backward
 masks. Our approach achieves state of the art
 performance in terms of both accuracy and sparsity
 compared to existing dynamic pruning algorithms on
 benchmark datasets, namely MNIST, CIFAR-10 and
 CIFAR-100.}
}

Endnote

%0 Conference Paper
%T Dynamic Forward and Backward Sparse Training
 (DFBST): Accelerated Deep Learning through
 Completely Sparse Training Schedule
%A Tejas Pote
%A Muhammad Athar Ganaie
%A Atif Hassan
%A Swanand Khare
%B Proceedings of The 14th Asian Conference on Machine
 Learning
%C Proceedings of Machine Learning Research
%D 2023
%E Emtiyaz Khan
%E Mehmet Gonen	
%F pmlr-v189-pote23a
%I PMLR
%P 848--863
%U https://proceedings.mlr.press/v189/pote23a.html
%V 189
%X Neural network sparsification has received a lot of
 attention in recent years. A number of dynamic
 sparse training methods have been developed that
 achieve significant sparsity levels during training,
 ensuring comparable performance to their dense
 counterparts. However, most of these methods update
 all the model parameters using dense gradients. To
 this end, gradient sparsification is achieved either
 by non-dynamic (fixed) schedule or computationally
 expensive dynamic pruning schedule. To alleviate
 these drawbacks, we propose Dynamic Forward and
 Backward Sparse Training (DFBST), an algorithm which
 dynamically sparsifies both the forward and backward
 passes using trainable masks, leading to a
 completely sparse training schedule. In contrast to
 existing sparse training methods, we propose
 separate learning for forward as well as backward
 masks. Our approach achieves state of the art
 performance in terms of both accuracy and sparsity
 compared to existing dynamic pruning algorithms on
 benchmark datasets, namely MNIST, CIFAR-10 and
 CIFAR-100.

APA


Pote, T., Ganaie, M.A., Hassan, A. & Khare, S.. (2023). Dynamic Forward and Backward Sparse Training
 (DFBST): Accelerated Deep Learning through
 Completely Sparse Training Schedule. Proceedings of The 14th Asian Conference on Machine
 Learning, in Proceedings of Machine Learning Research 189:848-863 Available from https://proceedings.mlr.press/v189/pote23a.html.

Related Material

Download PDF