Multi-Objective Reference-Aligned Machine Unlearning

Rasa Khosrowshahli, Stephen Asobiela, Beatrice Ombuki-Berman, Shahryrar Rahnamayan
Proceedings of the The 39th Canadian Conference on Artificial Intelligence, PMLR 318:833-839, 2026.

Abstract

Machine unlearning aims to remove the influence of specific training samples while preserving the model’s utility. Existing single-objective approaches, such as gradient ascent or random relabeling, often induce catastrophic forgetting due to conflicting optimization dynamics and unbounded forgetting objectives that cause the model to drift from its pre-trained knowledge. We propose Reference-Aligned UnLearning (RAUL), a multi-objective framework that jointly optimizes forgetting and retention by replacing unbounded loss maximization with a bounded KL alignment of predictions on forgotten samples toward a reference distribution representing unseen data, instantiated either as a uniform distribution or an empirical distribution from a held-out reference set, which constrains the forgetting objective and reduces gradient conflict with retention. The resulting multi-objective optimization (MOO) problem is solved via Jacobian descent, which aggregates multiple gradients into a direction that does not conflict. Our results demonstrate that RAUL achieves the closest gap compared to full retraining.

Cite this Paper


BibTeX
@InProceedings{pmlr-v318-khosrowshahli26a, title = {Multi-Objective Reference-Aligned Machine Unlearning}, author = {Khosrowshahli, Rasa and Asobiela, Stephen and Ombuki-Berman, Beatrice and Rahnamayan, Shahryrar}, booktitle = {Proceedings of the The 39th Canadian Conference on Artificial Intelligence}, pages = {833--839}, year = {2026}, editor = {Bouzar-Benlabiod, Lydia and Leung, Carson}, volume = {318}, series = {Proceedings of Machine Learning Research}, month = {25--29 May}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v318/main/assets/khosrowshahli26a/khosrowshahli26a.pdf}, url = {https://proceedings.mlr.press/v318/khosrowshahli26a.html}, abstract = {Machine unlearning aims to remove the influence of specific training samples while preserving the model’s utility. Existing single-objective approaches, such as gradient ascent or random relabeling, often induce catastrophic forgetting due to conflicting optimization dynamics and unbounded forgetting objectives that cause the model to drift from its pre-trained knowledge. We propose Reference-Aligned UnLearning (RAUL), a multi-objective framework that jointly optimizes forgetting and retention by replacing unbounded loss maximization with a bounded KL alignment of predictions on forgotten samples toward a reference distribution representing unseen data, instantiated either as a uniform distribution or an empirical distribution from a held-out reference set, which constrains the forgetting objective and reduces gradient conflict with retention. The resulting multi-objective optimization (MOO) problem is solved via Jacobian descent, which aggregates multiple gradients into a direction that does not conflict. Our results demonstrate that RAUL achieves the closest gap compared to full retraining.} }
Endnote
%0 Conference Paper %T Multi-Objective Reference-Aligned Machine Unlearning %A Rasa Khosrowshahli %A Stephen Asobiela %A Beatrice Ombuki-Berman %A Shahryrar Rahnamayan %B Proceedings of the The 39th Canadian Conference on Artificial Intelligence %C Proceedings of Machine Learning Research %D 2026 %E Lydia Bouzar-Benlabiod %E Carson Leung %F pmlr-v318-khosrowshahli26a %I PMLR %P 833--839 %U https://proceedings.mlr.press/v318/khosrowshahli26a.html %V 318 %X Machine unlearning aims to remove the influence of specific training samples while preserving the model’s utility. Existing single-objective approaches, such as gradient ascent or random relabeling, often induce catastrophic forgetting due to conflicting optimization dynamics and unbounded forgetting objectives that cause the model to drift from its pre-trained knowledge. We propose Reference-Aligned UnLearning (RAUL), a multi-objective framework that jointly optimizes forgetting and retention by replacing unbounded loss maximization with a bounded KL alignment of predictions on forgotten samples toward a reference distribution representing unseen data, instantiated either as a uniform distribution or an empirical distribution from a held-out reference set, which constrains the forgetting objective and reduces gradient conflict with retention. The resulting multi-objective optimization (MOO) problem is solved via Jacobian descent, which aggregates multiple gradients into a direction that does not conflict. Our results demonstrate that RAUL achieves the closest gap compared to full retraining.
APA
Khosrowshahli, R., Asobiela, S., Ombuki-Berman, B. & Rahnamayan, S.. (2026). Multi-Objective Reference-Aligned Machine Unlearning. Proceedings of the The 39th Canadian Conference on Artificial Intelligence, in Proceedings of Machine Learning Research 318:833-839 Available from https://proceedings.mlr.press/v318/khosrowshahli26a.html.

Related Material