Anomaly Detection in Extreme Regions via Empirical MV-sets on the Sphere

Albert Thomas, Stéphan Clemencon, Alexandre Gramfort, Anne Sabourin
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLR 54:1011-1019, 2017.

Abstract

Extreme regions in the feature space are of particular concern for anomaly detection: anomalies are likely to be located in the tails, whereas data scarcity in such regions makes it difficult to distinguish between large normal instances and anomalies. This paper presents an unsupervised algorithm for anomaly detection in extreme regions. We propose a Minimum Volume set (MV-set) approach relying on multivariate extreme value theory. This framework includes a canonical pre-processing step, which addresses the issue of output sensitivity to standardization choices. The resulting data representation on the sphere highlights the dependence structure of the extremal observations. Anomaly detection is then cast as a MV-set estimation problem on the sphere, where volume is measured by the spherical measure and mass refers to the angular measure. An anomaly then corresponds to an unusual observation given that one of its variables is large. A preliminary rate bound analysis is carried out for the learning method we introduce and its computational advantages are discussed and illustrated by numerical experiments.

Cite this Paper


BibTeX
@InProceedings{pmlr-v54-thomas17a, title = {{Anomaly Detection in Extreme Regions via Empirical MV-sets on the Sphere}}, author = {Thomas, Albert and Clemencon, Stéphan and Gramfort, Alexandre and Sabourin, Anne}, booktitle = {Proceedings of the 20th International Conference on Artificial Intelligence and Statistics}, pages = {1011--1019}, year = {2017}, editor = {Singh, Aarti and Zhu, Jerry}, volume = {54}, series = {Proceedings of Machine Learning Research}, month = {20--22 Apr}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v54/thomas17a/thomas17a.pdf}, url = {https://proceedings.mlr.press/v54/thomas17a.html}, abstract = {Extreme regions in the feature space are of particular concern for anomaly detection: anomalies are likely to be located in the tails, whereas data scarcity in such regions makes it difficult to distinguish between large normal instances and anomalies. This paper presents an unsupervised algorithm for anomaly detection in extreme regions. We propose a Minimum Volume set (MV-set) approach relying on multivariate extreme value theory. This framework includes a canonical pre-processing step, which addresses the issue of output sensitivity to standardization choices. The resulting data representation on the sphere highlights the dependence structure of the extremal observations. Anomaly detection is then cast as a MV-set estimation problem on the sphere, where volume is measured by the spherical measure and mass refers to the angular measure. An anomaly then corresponds to an unusual observation given that one of its variables is large. A preliminary rate bound analysis is carried out for the learning method we introduce and its computational advantages are discussed and illustrated by numerical experiments.} }
Endnote
%0 Conference Paper %T Anomaly Detection in Extreme Regions via Empirical MV-sets on the Sphere %A Albert Thomas %A Stéphan Clemencon %A Alexandre Gramfort %A Anne Sabourin %B Proceedings of the 20th International Conference on Artificial Intelligence and Statistics %C Proceedings of Machine Learning Research %D 2017 %E Aarti Singh %E Jerry Zhu %F pmlr-v54-thomas17a %I PMLR %P 1011--1019 %U https://proceedings.mlr.press/v54/thomas17a.html %V 54 %X Extreme regions in the feature space are of particular concern for anomaly detection: anomalies are likely to be located in the tails, whereas data scarcity in such regions makes it difficult to distinguish between large normal instances and anomalies. This paper presents an unsupervised algorithm for anomaly detection in extreme regions. We propose a Minimum Volume set (MV-set) approach relying on multivariate extreme value theory. This framework includes a canonical pre-processing step, which addresses the issue of output sensitivity to standardization choices. The resulting data representation on the sphere highlights the dependence structure of the extremal observations. Anomaly detection is then cast as a MV-set estimation problem on the sphere, where volume is measured by the spherical measure and mass refers to the angular measure. An anomaly then corresponds to an unusual observation given that one of its variables is large. A preliminary rate bound analysis is carried out for the learning method we introduce and its computational advantages are discussed and illustrated by numerical experiments.
APA
Thomas, A., Clemencon, S., Gramfort, A. & Sabourin, A.. (2017). Anomaly Detection in Extreme Regions via Empirical MV-sets on the Sphere. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 54:1011-1019 Available from https://proceedings.mlr.press/v54/thomas17a.html.

Related Material