A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery

Ian En-Hsu Yen; Xin Lin; Jiong Zhang; Pradeep Ravikumar; Inderjit Dhillon

A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery

Ian En-Hsu Yen, Xin Lin, Jiong Zhang, Pradeep Ravikumar, Inderjit Dhillon

Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:2272-2280, 2016.

Abstract

Multiple Sequence Alignment and Motif Discovery, known as NP-hard problems, are two fundamental tasks in Bioinformatics. Existing approaches to these two problems are based on either local search methods such as Expectation Maximization (EM), Gibbs Sampling or greedy heuristic methods. In this work, we develop a convex relaxation approach to both problems based on the recent concept of atomic norm and develop a new algorithm, termed Greedy Direction Method of Multiplier, for solving the convex relaxation with two convex atomic constraints. Experiments show that our convex relaxation approach produces solutions of higher quality than those standard tools widely-used in Bioinformatics community on the Multiple Sequence Alignment and Motif Discovery problems.

Cite this Paper

BibTeX


@InProceedings{pmlr-v48-yena16,
  title = 	 {A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery},
  author = 	 {Yen, Ian En-Hsu and Lin, Xin and Zhang, Jiong and Ravikumar, Pradeep and Dhillon, Inderjit},
  booktitle = 	 {Proceedings of The 33rd International Conference on Machine Learning},
  pages = 	 {2272--2280},
  year = 	 {2016},
  editor = 	 {Balcan, Maria Florina and Weinberger, Kilian Q.},
  volume = 	 {48},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {New York, New York, USA},
  month = 	 {20--22 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v48/yena16.pdf},
  url = 	 {https://proceedings.mlr.press/v48/yena16.html},
  abstract = 	 {Multiple Sequence Alignment and Motif Discovery, known as NP-hard problems, are two fundamental tasks in Bioinformatics. Existing approaches to these two problems are based on either local search methods such as Expectation Maximization (EM), Gibbs Sampling or greedy heuristic methods. In this work, we develop a convex relaxation approach to both problems based on the recent concept of atomic norm and develop a new algorithm, termed Greedy Direction Method of Multiplier, for solving the convex relaxation with two convex atomic constraints. Experiments show that our convex relaxation approach produces solutions of higher quality than those standard tools widely-used in Bioinformatics community on the Multiple Sequence Alignment and Motif Discovery problems.}
}

Endnote

%0 Conference Paper
%T A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery
%A Ian En-Hsu Yen
%A Xin Lin
%A Jiong Zhang
%A Pradeep Ravikumar
%A Inderjit Dhillon
%B Proceedings of The 33rd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2016
%E Maria Florina Balcan
%E Kilian Q. Weinberger	
%F pmlr-v48-yena16
%I PMLR
%P 2272--2280
%U https://proceedings.mlr.press/v48/yena16.html
%V 48
%X Multiple Sequence Alignment and Motif Discovery, known as NP-hard problems, are two fundamental tasks in Bioinformatics. Existing approaches to these two problems are based on either local search methods such as Expectation Maximization (EM), Gibbs Sampling or greedy heuristic methods. In this work, we develop a convex relaxation approach to both problems based on the recent concept of atomic norm and develop a new algorithm, termed Greedy Direction Method of Multiplier, for solving the convex relaxation with two convex atomic constraints. Experiments show that our convex relaxation approach produces solutions of higher quality than those standard tools widely-used in Bioinformatics community on the Multiple Sequence Alignment and Motif Discovery problems.

RIS


TY  - CPAPER
TI  - A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery
AU  - Ian En-Hsu Yen
AU  - Xin Lin
AU  - Jiong Zhang
AU  - Pradeep Ravikumar
AU  - Inderjit Dhillon
BT  - Proceedings of The 33rd International Conference on Machine Learning
DA  - 2016/06/11
ED  - Maria Florina Balcan
ED  - Kilian Q. Weinberger	
ID  - pmlr-v48-yena16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 48
SP  - 2272
EP  - 2280
L1  - http://proceedings.mlr.press/v48/yena16.pdf
UR  - https://proceedings.mlr.press/v48/yena16.html
AB  - Multiple Sequence Alignment and Motif Discovery, known as NP-hard problems, are two fundamental tasks in Bioinformatics. Existing approaches to these two problems are based on either local search methods such as Expectation Maximization (EM), Gibbs Sampling or greedy heuristic methods. In this work, we develop a convex relaxation approach to both problems based on the recent concept of atomic norm and develop a new algorithm, termed Greedy Direction Method of Multiplier, for solving the convex relaxation with two convex atomic constraints. Experiments show that our convex relaxation approach produces solutions of higher quality than those standard tools widely-used in Bioinformatics community on the Multiple Sequence Alignment and Motif Discovery problems.
ER  -

APA


Yen, I.E., Lin, X., Zhang, J., Ravikumar, P. & Dhillon, I.. (2016). A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:2272-2280 Available from https://proceedings.mlr.press/v48/yena16.html.

A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery

Abstract

Cite this Paper

Related Material