Federated Learning with Uncertainty via Distilled Predictive Distributions

Shrey Bhatt; Aishwarya Gupta; Piyush Rai

Federated Learning with Uncertainty via Distilled Predictive Distributions

Shrey Bhatt, Aishwarya Gupta, Piyush Rai

Proceedings of the 15th Asian Conference on Machine Learning, PMLR 222:153-168, 2024.

Abstract

Most existing federated learning methods are unable to estimate model/predictive uncertainty since the client models are trained using the standard loss function minimization approach which ignores such uncertainties. In many situations, however, especially in limited data settings, it is beneficial to take into account the uncertainty in the model parameters at each client as it leads to more accurate predictions and also because reliable estimates of uncertainty can be used for tasks, such as out-of-distribution (OOD) detection, and sequential decision-making tasks, such as active learning. We present a framework for federated learning with uncertainty where, in each round, each client infers the posterior distribution over its parameters as well as the posterior predictive distribution (PPD), distills the PPD into a single deep neural network, and sends this network to the server. Unlike some of the recent Bayesian approaches to federated learning, our approach does not require sending the whole posterior distribution of the parameters from each client to the server but only the PPD in the distilled form as a deep neural network. In addition, when making predictions at test time, it does not require computationally expensive Monte-Carlo averaging over the posterior distribution because our approach always maintains the PPD in form a single deep neural network. Moreover, our approach does not make any restrictive assumptions, such as the form of the clients’ posterior distributions, or of their PPDs. We evaluate our approach on classification in federated setting, as well as active learning and OOD detection in federated settings, on which our approach outperforms various existing federated learning baselines.

Cite this Paper

BibTeX


@InProceedings{pmlr-v222-bhatt24a,
  title = 	 {Federated Learning with Uncertainty via Distilled Predictive Distributions},
  author =       {Bhatt, Shrey and Gupta, Aishwarya and Rai, Piyush},
  booktitle = 	 {Proceedings of the 15th Asian Conference on Machine Learning},
  pages = 	 {153--168},
  year = 	 {2024},
  editor = 	 {Yanıkoğlu, Berrin and Buntine, Wray},
  volume = 	 {222},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {11--14 Nov},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v222/bhatt24a/bhatt24a.pdf},
  url = 	 {https://proceedings.mlr.press/v222/bhatt24a.html},
  abstract = 	 {Most existing federated learning methods are unable to estimate model/predictive uncertainty since the client models are trained using the standard loss function minimization approach which ignores such uncertainties. In many situations, however, especially in limited data settings, it is beneficial to take into account the uncertainty in the model parameters at each client as it leads to more accurate predictions and also because reliable estimates of uncertainty can be used for tasks, such as out-of-distribution (OOD) detection, and sequential decision-making tasks, such as active learning. We present a framework for federated learning with uncertainty where, in each round, each client infers the posterior distribution over its parameters as well as the posterior predictive distribution (PPD), distills the PPD into a single deep neural network, and sends this network to the server. Unlike some of the recent Bayesian approaches to federated learning, our approach does not require sending the whole posterior distribution of the parameters from each client to the server but only the PPD in the distilled form as a deep neural network. In addition, when making predictions at test time, it does not require computationally expensive Monte-Carlo averaging over the posterior distribution because our approach always maintains the PPD in form a single deep neural network. Moreover, our approach does not make any restrictive assumptions, such as the form of the clients’ posterior distributions, or of their PPDs. We evaluate our approach on classification in federated setting, as well as active learning and OOD detection in federated settings, on which our approach outperforms various existing federated learning baselines.}
}

Endnote

%0 Conference Paper
%T Federated Learning with Uncertainty via Distilled Predictive Distributions
%A Shrey Bhatt
%A Aishwarya Gupta
%A Piyush Rai
%B Proceedings of the 15th Asian Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Berrin Yanıkoğlu
%E Wray Buntine	
%F pmlr-v222-bhatt24a
%I PMLR
%P 153--168
%U https://proceedings.mlr.press/v222/bhatt24a.html
%V 222
%X Most existing federated learning methods are unable to estimate model/predictive uncertainty since the client models are trained using the standard loss function minimization approach which ignores such uncertainties. In many situations, however, especially in limited data settings, it is beneficial to take into account the uncertainty in the model parameters at each client as it leads to more accurate predictions and also because reliable estimates of uncertainty can be used for tasks, such as out-of-distribution (OOD) detection, and sequential decision-making tasks, such as active learning. We present a framework for federated learning with uncertainty where, in each round, each client infers the posterior distribution over its parameters as well as the posterior predictive distribution (PPD), distills the PPD into a single deep neural network, and sends this network to the server. Unlike some of the recent Bayesian approaches to federated learning, our approach does not require sending the whole posterior distribution of the parameters from each client to the server but only the PPD in the distilled form as a deep neural network. In addition, when making predictions at test time, it does not require computationally expensive Monte-Carlo averaging over the posterior distribution because our approach always maintains the PPD in form a single deep neural network. Moreover, our approach does not make any restrictive assumptions, such as the form of the clients’ posterior distributions, or of their PPDs. We evaluate our approach on classification in federated setting, as well as active learning and OOD detection in federated settings, on which our approach outperforms various existing federated learning baselines.

APA


Bhatt, S., Gupta, A. & Rai, P.. (2024). Federated Learning with Uncertainty via Distilled Predictive Distributions. Proceedings of the 15th Asian Conference on Machine Learning, in Proceedings of Machine Learning Research 222:153-168 Available from https://proceedings.mlr.press/v222/bhatt24a.html.

Federated Learning with Uncertainty via Distilled Predictive Distributions

Abstract

Cite this Paper

Related Material