Feature Extraction for Machine Learning: Logic-Probabilistic Approach

Vladimir Gorodetsky, Vladimir Samoylov
Proceedings of the Fourth International Workshop on Feature Selection in Data Mining, PMLR 10:55-65, 2010.

Abstract

The paper analyzes peculiarities of preprocessing of learning data represented in object data bases constituted by multiple relational tables with ontology on top of it. Exactly such learning data structures are peculiar to many novel challenging applications. The paper proposes a new technology supported by a number of novel algorithms intended for ontology-centered transformation of heterogeneous possibly poor structured learning data into homogeneous informative binary feature space based on (1) aggregation of the ontology notion instances and their attribute domains and subsequent probabilistic cause-consequence analysis aimed at extraction more informative features. The proposed technology is fully implemented and validated on several case studies.

Cite this Paper


BibTeX
@InProceedings{pmlr-v10-gorodetsky10a, title = {Feature Extraction for Machine Learning: Logic-Probabilistic Approach}, author = {Gorodetsky, Vladimir and Samoylov, Vladimir}, booktitle = {Proceedings of the Fourth International Workshop on Feature Selection in Data Mining}, pages = {55--65}, year = {2010}, editor = {Liu, Huan and Motoda, Hiroshi and Setiono, Rudy and Zhao, Zheng}, volume = {10}, series = {Proceedings of Machine Learning Research}, address = {Hyderabad, India}, month = {21 Jun}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v10/gorodetsky10a/gorodetsky10a.pdf}, url = {https://proceedings.mlr.press/v10/gorodetsky10a.html}, abstract = {The paper analyzes peculiarities of preprocessing of learning data represented in object data bases constituted by multiple relational tables with ontology on top of it. Exactly such learning data structures are peculiar to many novel challenging applications. The paper proposes a new technology supported by a number of novel algorithms intended for ontology-centered transformation of heterogeneous possibly poor structured learning data into homogeneous informative binary feature space based on (1) aggregation of the ontology notion instances and their attribute domains and subsequent probabilistic cause-consequence analysis aimed at extraction more informative features. The proposed technology is fully implemented and validated on several case studies.} }
Endnote
%0 Conference Paper %T Feature Extraction for Machine Learning: Logic-Probabilistic Approach %A Vladimir Gorodetsky %A Vladimir Samoylov %B Proceedings of the Fourth International Workshop on Feature Selection in Data Mining %C Proceedings of Machine Learning Research %D 2010 %E Huan Liu %E Hiroshi Motoda %E Rudy Setiono %E Zheng Zhao %F pmlr-v10-gorodetsky10a %I PMLR %P 55--65 %U https://proceedings.mlr.press/v10/gorodetsky10a.html %V 10 %X The paper analyzes peculiarities of preprocessing of learning data represented in object data bases constituted by multiple relational tables with ontology on top of it. Exactly such learning data structures are peculiar to many novel challenging applications. The paper proposes a new technology supported by a number of novel algorithms intended for ontology-centered transformation of heterogeneous possibly poor structured learning data into homogeneous informative binary feature space based on (1) aggregation of the ontology notion instances and their attribute domains and subsequent probabilistic cause-consequence analysis aimed at extraction more informative features. The proposed technology is fully implemented and validated on several case studies.
RIS
TY - CPAPER TI - Feature Extraction for Machine Learning: Logic-Probabilistic Approach AU - Vladimir Gorodetsky AU - Vladimir Samoylov BT - Proceedings of the Fourth International Workshop on Feature Selection in Data Mining DA - 2010/05/26 ED - Huan Liu ED - Hiroshi Motoda ED - Rudy Setiono ED - Zheng Zhao ID - pmlr-v10-gorodetsky10a PB - PMLR DP - Proceedings of Machine Learning Research VL - 10 SP - 55 EP - 65 L1 - http://proceedings.mlr.press/v10/gorodetsky10a/gorodetsky10a.pdf UR - https://proceedings.mlr.press/v10/gorodetsky10a.html AB - The paper analyzes peculiarities of preprocessing of learning data represented in object data bases constituted by multiple relational tables with ontology on top of it. Exactly such learning data structures are peculiar to many novel challenging applications. The paper proposes a new technology supported by a number of novel algorithms intended for ontology-centered transformation of heterogeneous possibly poor structured learning data into homogeneous informative binary feature space based on (1) aggregation of the ontology notion instances and their attribute domains and subsequent probabilistic cause-consequence analysis aimed at extraction more informative features. The proposed technology is fully implemented and validated on several case studies. ER -
APA
Gorodetsky, V. & Samoylov, V.. (2010). Feature Extraction for Machine Learning: Logic-Probabilistic Approach. Proceedings of the Fourth International Workshop on Feature Selection in Data Mining, in Proceedings of Machine Learning Research 10:55-65 Available from https://proceedings.mlr.press/v10/gorodetsky10a.html.

Related Material