Efficient Second-Order Gradient Boosting for Conditional Random Fields

Tianqi Chen; Sameer Singh; Ben Taskar; Carlos Guestrin

Efficient Second-Order Gradient Boosting for Conditional Random Fields

Tianqi Chen, Sameer Singh, Ben Taskar, Carlos Guestrin

Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, PMLR 38:147-155, 2015.

Abstract

Conditional random fields (CRFs) are an important class of models for accurate structured prediction, but effective design of the feature functions is a major challenge when applying CRF models to real world data. Gradient boosting, which is used to automatically induce and select feature functions, is a natural candidate solution to the problem. However, it is non-trivial to derive gradient boosting algorithms for CRFs due to the dense Hessian matrices introduced by variable dependencies. Existing approaches thus use only first-order information when optimizing likelihood, and hence face convergence issues. We incorporate second-order information by deriving a Markov Chain mixing rate bound to quantify the dependencies, and introduce a gradient boosting algorithm that iteratively optimizes an adaptive upper bound of the objective function. The resulting algorithm induces and selects features for CRFs via functional space optimization, with provable convergence guarantees. Experimental results on three real world datasets demonstrate that the mixing rate based upper bound is effective for learning CRFs with non-linear potentials.

Cite this Paper

BibTeX


@InProceedings{pmlr-v38-chen15b,
  title = 	 {{Efficient Second-Order Gradient Boosting for Conditional Random Fields}},
  author = 	 {Chen, Tianqi and Singh, Sameer and Taskar, Ben and Guestrin, Carlos},
  booktitle = 	 {Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics},
  pages = 	 {147--155},
  year = 	 {2015},
  editor = 	 {Lebanon, Guy and Vishwanathan, S. V. N.},
  volume = 	 {38},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {San Diego, California, USA},
  month = 	 {09--12 May},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v38/chen15b.pdf},
  url = 	 {https://proceedings.mlr.press/v38/chen15b.html},
  abstract = 	 {Conditional random fields (CRFs) are an important class of models for accurate structured prediction, but effective design of the feature functions is a major challenge when applying CRF models to real world data. Gradient boosting, which is used to automatically induce and select feature functions, is a natural candidate solution to the problem.  However, it is non-trivial to derive gradient boosting algorithms for CRFs due to the dense Hessian matrices introduced by variable dependencies. Existing approaches thus use only first-order information when optimizing likelihood, and hence face convergence issues. We incorporate second-order information by deriving a Markov Chain mixing rate bound to quantify the dependencies, and introduce a gradient boosting algorithm that iteratively optimizes an adaptive upper bound of the objective function.  The resulting algorithm induces and selects features for CRFs via functional space optimization, with provable convergence guarantees.  Experimental results on three real world datasets demonstrate that the mixing rate based upper bound is effective for learning CRFs with non-linear potentials.}
}

Endnote

%0 Conference Paper
%T Efficient Second-Order Gradient Boosting for Conditional Random Fields
%A Tianqi Chen
%A Sameer Singh
%A Ben Taskar
%A Carlos Guestrin
%B Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2015
%E Guy Lebanon
%E S. V. N. Vishwanathan	
%F pmlr-v38-chen15b
%I PMLR
%P 147--155
%U https://proceedings.mlr.press/v38/chen15b.html
%V 38
%X Conditional random fields (CRFs) are an important class of models for accurate structured prediction, but effective design of the feature functions is a major challenge when applying CRF models to real world data. Gradient boosting, which is used to automatically induce and select feature functions, is a natural candidate solution to the problem.  However, it is non-trivial to derive gradient boosting algorithms for CRFs due to the dense Hessian matrices introduced by variable dependencies. Existing approaches thus use only first-order information when optimizing likelihood, and hence face convergence issues. We incorporate second-order information by deriving a Markov Chain mixing rate bound to quantify the dependencies, and introduce a gradient boosting algorithm that iteratively optimizes an adaptive upper bound of the objective function.  The resulting algorithm induces and selects features for CRFs via functional space optimization, with provable convergence guarantees.  Experimental results on three real world datasets demonstrate that the mixing rate based upper bound is effective for learning CRFs with non-linear potentials.

RIS


TY  - CPAPER
TI  - Efficient Second-Order Gradient Boosting for Conditional Random Fields
AU  - Tianqi Chen
AU  - Sameer Singh
AU  - Ben Taskar
AU  - Carlos Guestrin
BT  - Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics
DA  - 2015/02/21
ED  - Guy Lebanon
ED  - S. V. N. Vishwanathan	
ID  - pmlr-v38-chen15b
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 38
SP  - 147
EP  - 155
L1  - http://proceedings.mlr.press/v38/chen15b.pdf
UR  - https://proceedings.mlr.press/v38/chen15b.html
AB  - Conditional random fields (CRFs) are an important class of models for accurate structured prediction, but effective design of the feature functions is a major challenge when applying CRF models to real world data. Gradient boosting, which is used to automatically induce and select feature functions, is a natural candidate solution to the problem.  However, it is non-trivial to derive gradient boosting algorithms for CRFs due to the dense Hessian matrices introduced by variable dependencies. Existing approaches thus use only first-order information when optimizing likelihood, and hence face convergence issues. We incorporate second-order information by deriving a Markov Chain mixing rate bound to quantify the dependencies, and introduce a gradient boosting algorithm that iteratively optimizes an adaptive upper bound of the objective function.  The resulting algorithm induces and selects features for CRFs via functional space optimization, with provable convergence guarantees.  Experimental results on three real world datasets demonstrate that the mixing rate based upper bound is effective for learning CRFs with non-linear potentials.
ER  -

APA


Chen, T., Singh, S., Taskar, B. & Guestrin, C.. (2015). Efficient Second-Order Gradient Boosting for Conditional Random Fields. Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 38:147-155 Available from https://proceedings.mlr.press/v38/chen15b.html.

Efficient Second-Order Gradient Boosting for Conditional Random Fields

Abstract

Cite this Paper

Related Material