Feature screening with kernel knockoffs

Benjamin Poignard, Peter J. Naylor, Héctor Climente-González, Makoto Yamada
Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:1935-1974, 2022.

Abstract

This article analyses three feature screening procedures: Kendall’s Tau and Spearman Rho (TR), Hilbert-Schmidt Independence Criterion (HSIC) and conditional Maximum Mean Discrepancy (cMMD), where the latter is a modified version of the standard MMD for categorical classification. These association measures are not based on any specific underlying model, such as the linear regression. We provide the conditions for which the sure independence screening (SIS) property is satisfied under a lower bound assumption on the minimum signal strength of the association measure. The SIS property for the HSIC and cMMD is established for given bounded and symmetric kernels. Within the high-dimensional setting, we propose a two-step approach to control the false discovery rate (FDR) using the knockoff filtering. The performances of the association measures are assessed through simulated and real data experiments and compared with existing competing screening methods.

Cite this Paper


BibTeX
@InProceedings{pmlr-v151-poignard22a, title = { Feature screening with kernel knockoffs }, author = {Poignard, Benjamin and Naylor, Peter J. and Climente-Gonz\'alez, H\'ector and Yamada, Makoto}, booktitle = {Proceedings of The 25th International Conference on Artificial Intelligence and Statistics}, pages = {1935--1974}, year = {2022}, editor = {Camps-Valls, Gustau and Ruiz, Francisco J. R. and Valera, Isabel}, volume = {151}, series = {Proceedings of Machine Learning Research}, month = {28--30 Mar}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v151/poignard22a/poignard22a.pdf}, url = {https://proceedings.mlr.press/v151/poignard22a.html}, abstract = { This article analyses three feature screening procedures: Kendall’s Tau and Spearman Rho (TR), Hilbert-Schmidt Independence Criterion (HSIC) and conditional Maximum Mean Discrepancy (cMMD), where the latter is a modified version of the standard MMD for categorical classification. These association measures are not based on any specific underlying model, such as the linear regression. We provide the conditions for which the sure independence screening (SIS) property is satisfied under a lower bound assumption on the minimum signal strength of the association measure. The SIS property for the HSIC and cMMD is established for given bounded and symmetric kernels. Within the high-dimensional setting, we propose a two-step approach to control the false discovery rate (FDR) using the knockoff filtering. The performances of the association measures are assessed through simulated and real data experiments and compared with existing competing screening methods. } }
Endnote
%0 Conference Paper %T Feature screening with kernel knockoffs %A Benjamin Poignard %A Peter J. Naylor %A Héctor Climente-González %A Makoto Yamada %B Proceedings of The 25th International Conference on Artificial Intelligence and Statistics %C Proceedings of Machine Learning Research %D 2022 %E Gustau Camps-Valls %E Francisco J. R. Ruiz %E Isabel Valera %F pmlr-v151-poignard22a %I PMLR %P 1935--1974 %U https://proceedings.mlr.press/v151/poignard22a.html %V 151 %X This article analyses three feature screening procedures: Kendall’s Tau and Spearman Rho (TR), Hilbert-Schmidt Independence Criterion (HSIC) and conditional Maximum Mean Discrepancy (cMMD), where the latter is a modified version of the standard MMD for categorical classification. These association measures are not based on any specific underlying model, such as the linear regression. We provide the conditions for which the sure independence screening (SIS) property is satisfied under a lower bound assumption on the minimum signal strength of the association measure. The SIS property for the HSIC and cMMD is established for given bounded and symmetric kernels. Within the high-dimensional setting, we propose a two-step approach to control the false discovery rate (FDR) using the knockoff filtering. The performances of the association measures are assessed through simulated and real data experiments and compared with existing competing screening methods.
APA
Poignard, B., Naylor, P.J., Climente-González, H. & Yamada, M.. (2022). Feature screening with kernel knockoffs . Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 151:1935-1974 Available from https://proceedings.mlr.press/v151/poignard22a.html.

Related Material