[edit]
Conformal prediction in manifold learning
Proceedings of the Seventh Workshop on Conformal and Probabilistic Prediction and Applications, PMLR 91:234-253, 2018.
Abstract
The paper presents a geometrically motivated view on conformal prediction applied to nonlinear multi-output regression tasks for obtaining valid measure of accuracy of Manifold Learning Regression algorithms. A considered regression task is to estimate an unknown smooth mapping \mathbff from q-dimensional inputs \mathbfx∈\mathbfX to m-dimensional outputs \mathbfy=\mathbff(\mathbfx) based on training dataset \mathbfZ(n) consisting of “input-output” pairs Zi=(\mathbfxi,\mathbfyi=\mathbff(\mathbfxi))\textrmT,i=1,2,...,n. Manifold Learning Regression (MLR) algorithm solves this task using Manifold learning technique. At first, unknown q-dimensional Regression manifold \mathbfM(\mathbff)=(\mathbfx,\mathbff(\mathbfx))\textrmT∈\mathrmRq+m:\mathbfx∈\mathbfX⊂\mathrmRq, embedded in ambient (q+m)-dimensional space, is estimated from the training data \mathbfZ(n), sampled from this manifold. The constructed estimator \mathbfM\textMLR, which is also q-dimensional manifold embedded in ambient space \textrmRq+m, is close to \mathbfM in terms of Hausdorff distance. After that, an estimator \mathbff\textMLR of the unknown function \mathbff, mapping arbitrary input \mathbfx∈\mathbfX to output \mathbff\textrmMLR(\mathbfx), is constructed as the solution to the equation \mathbfM(\mathbff\textrmMLR)=\mathbfM\textMLR. Conformal prediction allows constructing a prediction region for an unknown output \mathbfy=\mathbff(\mathbfx) at Out-of-Sample input point \mathbfx for a given confidence level using given nonconformity measure, characterizing to which extent an example Z=(\mathbfx,\mathbfy)\textrmT is different from examples in the known dataset \mathbfZ(n). The paper proposes a new nonconformity measure based on MLR estimator using an analog of Bregman distance.