Enumerating Distinct Decision Trees

[edit]

Salvatore Ruggieri ;
Proceedings of the 34th International Conference on Machine Learning, PMLR 70:2960-2968, 2017.

Abstract

The search space for the feature selection problem in decision tree learning is the lattice of subsets of the available features. We provide an exact enumeration procedure of the subsets that lead to all and only the distinct decision trees. The procedure can be adopted to prune the search space of complete and heuristics search methods in wrapper models for feature selection. Based on this, we design a computational optimization of the sequential backward elimination heuristics with a performance improvement of up to 100X.

Related Material