The Role of Codeword-to-Class Assignments in Error-Correcting Codes: An Empirical Study

Itay Evron, Ophir Onn, Tamar Weiss, Hai Azeroual, Daniel Soudry
Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, PMLR 206:8053-8077, 2023.

Abstract

Error-correcting codes (ECC) are used to reduce multiclass classification tasks to multiple binary classification subproblems. In ECC, classes are represented by the rows of a binary matrix, corresponding to codewords in a codebook. Codebooks are commonly either predefined or problem dependent. Given predefined codebooks, codeword-to-class assignments are traditionally overlooked, and codewords are implicitly assigned to classes arbitrarily. Our paper shows that these assignments play a major role in the performance of ECC. Specifically, we examine similarity-preserving assignments, where similar codewords are assigned to similar classes. Addressing a controversy in existing literature, our extensive experiments confirm that similarity-preserving assignments induce easier subproblems and are superior to other assignment policies in terms of their generalization performance. We find that similarity-preserving assignments make predefined codebooks become problem-dependent, without altering other favorable codebook properties. Finally, we show that our findings can improve predefined codebooks dedicated to extreme classification.

Cite this Paper


BibTeX
@InProceedings{pmlr-v206-evron23a, title = {The Role of Codeword-to-Class Assignments in Error-Correcting Codes: An Empirical Study}, author = {Evron, Itay and Onn, Ophir and Weiss, Tamar and Azeroual, Hai and Soudry, Daniel}, booktitle = {Proceedings of The 26th International Conference on Artificial Intelligence and Statistics}, pages = {8053--8077}, year = {2023}, editor = {Ruiz, Francisco and Dy, Jennifer and van de Meent, Jan-Willem}, volume = {206}, series = {Proceedings of Machine Learning Research}, month = {25--27 Apr}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v206/evron23a/evron23a.pdf}, url = {https://proceedings.mlr.press/v206/evron23a.html}, abstract = {Error-correcting codes (ECC) are used to reduce multiclass classification tasks to multiple binary classification subproblems. In ECC, classes are represented by the rows of a binary matrix, corresponding to codewords in a codebook. Codebooks are commonly either predefined or problem dependent. Given predefined codebooks, codeword-to-class assignments are traditionally overlooked, and codewords are implicitly assigned to classes arbitrarily. Our paper shows that these assignments play a major role in the performance of ECC. Specifically, we examine similarity-preserving assignments, where similar codewords are assigned to similar classes. Addressing a controversy in existing literature, our extensive experiments confirm that similarity-preserving assignments induce easier subproblems and are superior to other assignment policies in terms of their generalization performance. We find that similarity-preserving assignments make predefined codebooks become problem-dependent, without altering other favorable codebook properties. Finally, we show that our findings can improve predefined codebooks dedicated to extreme classification.} }
Endnote
%0 Conference Paper %T The Role of Codeword-to-Class Assignments in Error-Correcting Codes: An Empirical Study %A Itay Evron %A Ophir Onn %A Tamar Weiss %A Hai Azeroual %A Daniel Soudry %B Proceedings of The 26th International Conference on Artificial Intelligence and Statistics %C Proceedings of Machine Learning Research %D 2023 %E Francisco Ruiz %E Jennifer Dy %E Jan-Willem van de Meent %F pmlr-v206-evron23a %I PMLR %P 8053--8077 %U https://proceedings.mlr.press/v206/evron23a.html %V 206 %X Error-correcting codes (ECC) are used to reduce multiclass classification tasks to multiple binary classification subproblems. In ECC, classes are represented by the rows of a binary matrix, corresponding to codewords in a codebook. Codebooks are commonly either predefined or problem dependent. Given predefined codebooks, codeword-to-class assignments are traditionally overlooked, and codewords are implicitly assigned to classes arbitrarily. Our paper shows that these assignments play a major role in the performance of ECC. Specifically, we examine similarity-preserving assignments, where similar codewords are assigned to similar classes. Addressing a controversy in existing literature, our extensive experiments confirm that similarity-preserving assignments induce easier subproblems and are superior to other assignment policies in terms of their generalization performance. We find that similarity-preserving assignments make predefined codebooks become problem-dependent, without altering other favorable codebook properties. Finally, we show that our findings can improve predefined codebooks dedicated to extreme classification.
APA
Evron, I., Onn, O., Weiss, T., Azeroual, H. & Soudry, D.. (2023). The Role of Codeword-to-Class Assignments in Error-Correcting Codes: An Empirical Study. Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 206:8053-8077 Available from https://proceedings.mlr.press/v206/evron23a.html.

Related Material