Non-parametric Inference Adaptive to Intrinsic Dimension

Khashayar Khosravi, Greg Lewis, Vasilis Syrgkanis
Proceedings of the First Conference on Causal Learning and Reasoning, PMLR 177:373-389, 2022.

Abstract

We consider non-parametric estimation and inference of conditional moment models in high dimensions. We show that even when the dimension D of the conditioning variable is larger than the sample size n, estimation and inference is feasible as long as the distribution of the conditioning variable has small intrinsic dimension d, as measured by locally low doubling measures. Our estimation is based on a sub-sampled ensemble of the k-nearest neighbors (k-NN) Z-estimator. We show that if the intrinsic dimension of the covariate distribution is equal to d, then the finite sample estimation error of our estimator is of order n1/(d+2) and our estimate is n1/(d+2)-asymptotically normal, irrespective of D. The sub-sampling size required for achieving these results depends on the unknown intrinsic dimension d. We propose an adaptive data-driven approach for choosing this parameter and prove that it achieves the desired rates. We discuss extensions and applications to heterogeneous treatment effect estimation.

Cite this Paper


BibTeX
@InProceedings{pmlr-v177-khosravi22a, title = {Non-parametric Inference Adaptive to Intrinsic Dimension}, author = {Khosravi, Khashayar and Lewis, Greg and Syrgkanis, Vasilis}, booktitle = {Proceedings of the First Conference on Causal Learning and Reasoning}, pages = {373--389}, year = {2022}, editor = {Schölkopf, Bernhard and Uhler, Caroline and Zhang, Kun}, volume = {177}, series = {Proceedings of Machine Learning Research}, month = {11--13 Apr}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v177/khosravi22a/khosravi22a.pdf}, url = {https://proceedings.mlr.press/v177/khosravi22a.html}, abstract = {We consider non-parametric estimation and inference of conditional moment models in high dimensions. We show that even when the dimension $D$ of the conditioning variable is larger than the sample size $n$, estimation and inference is feasible as long as the distribution of the conditioning variable has small intrinsic dimension $d$, as measured by locally low doubling measures. Our estimation is based on a sub-sampled ensemble of the $k$-nearest neighbors ($k$-NN) $Z$-estimator. We show that if the intrinsic dimension of the covariate distribution is equal to $d$, then the finite sample estimation error of our estimator is of order $n^{-1/(d+2)}$ and our estimate is $n^{1/(d+2)}$-asymptotically normal, irrespective of $D$. The sub-sampling size required for achieving these results depends on the unknown intrinsic dimension $d$. We propose an adaptive data-driven approach for choosing this parameter and prove that it achieves the desired rates. We discuss extensions and applications to heterogeneous treatment effect estimation.} }
Endnote
%0 Conference Paper %T Non-parametric Inference Adaptive to Intrinsic Dimension %A Khashayar Khosravi %A Greg Lewis %A Vasilis Syrgkanis %B Proceedings of the First Conference on Causal Learning and Reasoning %C Proceedings of Machine Learning Research %D 2022 %E Bernhard Schölkopf %E Caroline Uhler %E Kun Zhang %F pmlr-v177-khosravi22a %I PMLR %P 373--389 %U https://proceedings.mlr.press/v177/khosravi22a.html %V 177 %X We consider non-parametric estimation and inference of conditional moment models in high dimensions. We show that even when the dimension $D$ of the conditioning variable is larger than the sample size $n$, estimation and inference is feasible as long as the distribution of the conditioning variable has small intrinsic dimension $d$, as measured by locally low doubling measures. Our estimation is based on a sub-sampled ensemble of the $k$-nearest neighbors ($k$-NN) $Z$-estimator. We show that if the intrinsic dimension of the covariate distribution is equal to $d$, then the finite sample estimation error of our estimator is of order $n^{-1/(d+2)}$ and our estimate is $n^{1/(d+2)}$-asymptotically normal, irrespective of $D$. The sub-sampling size required for achieving these results depends on the unknown intrinsic dimension $d$. We propose an adaptive data-driven approach for choosing this parameter and prove that it achieves the desired rates. We discuss extensions and applications to heterogeneous treatment effect estimation.
APA
Khosravi, K., Lewis, G. & Syrgkanis, V.. (2022). Non-parametric Inference Adaptive to Intrinsic Dimension. Proceedings of the First Conference on Causal Learning and Reasoning, in Proceedings of Machine Learning Research 177:373-389 Available from https://proceedings.mlr.press/v177/khosravi22a.html.

Related Material