An Interactive Visual Demo of Bias Mitigation Techniques for Word Representations From a Geometric Perspective

Archit Rathore, Sunipa Dev, Vivek Srikumar, Jeff M Phillips, Yan Zheng, Michael Yeh, Junpeng Wang, Wei Zhang, Bei Wang
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, PMLR 176:330-335, 2022.

Abstract

Language representations are known to encode and propagate biases, i.e., stereotypical associations between words or groups of words that may cause representational harm. In this demo, we utilize interactive visualization to increase the interpretability of a number of state-of-the-art techniques that are designed to identify, mitigate, and attenuate these biases in word representations, in particular, from a geometric perspective. We provide an open source web-based visualization tool and offer hands-on experience in exploring the effects of these debiasing techniques on the geometry of high-dimensional word vectors. To help understand how various debiasing techniques change the underlying geometry, we decompose each technique into modular and interpretable sequences of primitive operations, and study their effect on the word vectors using dimensionality reduction and interactive visual exploration. This demo is primarily designed to aid natural language processing (NLP) practitioners and researchers working with fairness and ethics of machine learning systems. It can also be used to educate NLP novices in understanding the existence of and then mitigating biases in word embeddings.

Cite this Paper


BibTeX
@InProceedings{pmlr-v176-rathore22a, title = {An Interactive Visual Demo of Bias Mitigation Techniques for Word Representations From a Geometric Perspective}, author = {Rathore, Archit and Dev, Sunipa and Srikumar, Vivek and Phillips, Jeff M and Zheng, Yan and Yeh, Michael and Wang, Junpeng and Zhang, Wei and Wang, Bei}, booktitle = {Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track}, pages = {330--335}, year = {2022}, editor = {Kiela, Douwe and Ciccone, Marco and Caputo, Barbara}, volume = {176}, series = {Proceedings of Machine Learning Research}, month = {06--14 Dec}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v176/rathore22a/rathore22a.pdf}, url = {https://proceedings.mlr.press/v176/rathore22a.html}, abstract = {Language representations are known to encode and propagate biases, i.e., stereotypical associations between words or groups of words that may cause representational harm. In this demo, we utilize interactive visualization to increase the interpretability of a number of state-of-the-art techniques that are designed to identify, mitigate, and attenuate these biases in word representations, in particular, from a geometric perspective. We provide an open source web-based visualization tool and offer hands-on experience in exploring the effects of these debiasing techniques on the geometry of high-dimensional word vectors. To help understand how various debiasing techniques change the underlying geometry, we decompose each technique into modular and interpretable sequences of primitive operations, and study their effect on the word vectors using dimensionality reduction and interactive visual exploration. This demo is primarily designed to aid natural language processing (NLP) practitioners and researchers working with fairness and ethics of machine learning systems. It can also be used to educate NLP novices in understanding the existence of and then mitigating biases in word embeddings. } }
Endnote
%0 Conference Paper %T An Interactive Visual Demo of Bias Mitigation Techniques for Word Representations From a Geometric Perspective %A Archit Rathore %A Sunipa Dev %A Vivek Srikumar %A Jeff M Phillips %A Yan Zheng %A Michael Yeh %A Junpeng Wang %A Wei Zhang %A Bei Wang %B Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track %C Proceedings of Machine Learning Research %D 2022 %E Douwe Kiela %E Marco Ciccone %E Barbara Caputo %F pmlr-v176-rathore22a %I PMLR %P 330--335 %U https://proceedings.mlr.press/v176/rathore22a.html %V 176 %X Language representations are known to encode and propagate biases, i.e., stereotypical associations between words or groups of words that may cause representational harm. In this demo, we utilize interactive visualization to increase the interpretability of a number of state-of-the-art techniques that are designed to identify, mitigate, and attenuate these biases in word representations, in particular, from a geometric perspective. We provide an open source web-based visualization tool and offer hands-on experience in exploring the effects of these debiasing techniques on the geometry of high-dimensional word vectors. To help understand how various debiasing techniques change the underlying geometry, we decompose each technique into modular and interpretable sequences of primitive operations, and study their effect on the word vectors using dimensionality reduction and interactive visual exploration. This demo is primarily designed to aid natural language processing (NLP) practitioners and researchers working with fairness and ethics of machine learning systems. It can also be used to educate NLP novices in understanding the existence of and then mitigating biases in word embeddings.
APA
Rathore, A., Dev, S., Srikumar, V., Phillips, J.M., Zheng, Y., Yeh, M., Wang, J., Zhang, W. & Wang, B.. (2022). An Interactive Visual Demo of Bias Mitigation Techniques for Word Representations From a Geometric Perspective. Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, in Proceedings of Machine Learning Research 176:330-335 Available from https://proceedings.mlr.press/v176/rathore22a.html.

Related Material