Wave-LSTM: Multi-scale analysis of somatic whole genome copy number profiles

Charles Gadd, Christopher Yau
Proceedings of the 19th Machine Learning in Computational Biology meeting, PMLR 261:28-37, 2024.

Abstract

Changes in the number of copies of certain parts of the genome, known as copy number alterations (CNAs), due to somatic mutation processes are a hallmark of many cancers. This genomic complexity is known to be associated with poorer outcomes for patients but describing its contribution in detail has been difficult. Copy number alterations can affect large regions spanning whole chromosomes or the entire genome itself but can also be localised to only small segments of the genome and no methods exist that allow this multi-scale nature to be quantified. In this paper, we address this using Wave-LSTM, a signal decomposition approach designed to capture the multi-scale structure of complex whole genome copy number profiles. Using wavelet-based source separation in combination with deep learning-based attention mechanisms. We show that Wave-LSTM can be used to derive multi-scale representations from copy number profiles which can be used to decipher sub-clonal structures from single-cell copy number data and to improve survival prediction performance from patient tumour profiles.

Cite this Paper


BibTeX
@InProceedings{pmlr-v261-gadd24a, title = {Wave-LSTM: Multi-scale analysis of somatic whole genome copy number profiles}, author = {Gadd, Charles and Yau, Christopher}, booktitle = {Proceedings of the 19th Machine Learning in Computational Biology meeting}, pages = {28--37}, year = {2024}, editor = {Knowles, David A and Mostafavi, Sara}, volume = {261}, series = {Proceedings of Machine Learning Research}, month = {05--06 Sep}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v261/main/assets/gadd24a/gadd24a.pdf}, url = {https://proceedings.mlr.press/v261/gadd24a.html}, abstract = {Changes in the number of copies of certain parts of the genome, known as copy number alterations (CNAs), due to somatic mutation processes are a hallmark of many cancers. This genomic complexity is known to be associated with poorer outcomes for patients but describing its contribution in detail has been difficult. Copy number alterations can affect large regions spanning whole chromosomes or the entire genome itself but can also be localised to only small segments of the genome and no methods exist that allow this multi-scale nature to be quantified. In this paper, we address this using Wave-LSTM, a signal decomposition approach designed to capture the multi-scale structure of complex whole genome copy number profiles. Using wavelet-based source separation in combination with deep learning-based attention mechanisms. We show that Wave-LSTM can be used to derive multi-scale representations from copy number profiles which can be used to decipher sub-clonal structures from single-cell copy number data and to improve survival prediction performance from patient tumour profiles.} }
Endnote
%0 Conference Paper %T Wave-LSTM: Multi-scale analysis of somatic whole genome copy number profiles %A Charles Gadd %A Christopher Yau %B Proceedings of the 19th Machine Learning in Computational Biology meeting %C Proceedings of Machine Learning Research %D 2024 %E David A Knowles %E Sara Mostafavi %F pmlr-v261-gadd24a %I PMLR %P 28--37 %U https://proceedings.mlr.press/v261/gadd24a.html %V 261 %X Changes in the number of copies of certain parts of the genome, known as copy number alterations (CNAs), due to somatic mutation processes are a hallmark of many cancers. This genomic complexity is known to be associated with poorer outcomes for patients but describing its contribution in detail has been difficult. Copy number alterations can affect large regions spanning whole chromosomes or the entire genome itself but can also be localised to only small segments of the genome and no methods exist that allow this multi-scale nature to be quantified. In this paper, we address this using Wave-LSTM, a signal decomposition approach designed to capture the multi-scale structure of complex whole genome copy number profiles. Using wavelet-based source separation in combination with deep learning-based attention mechanisms. We show that Wave-LSTM can be used to derive multi-scale representations from copy number profiles which can be used to decipher sub-clonal structures from single-cell copy number data and to improve survival prediction performance from patient tumour profiles.
APA
Gadd, C. & Yau, C.. (2024). Wave-LSTM: Multi-scale analysis of somatic whole genome copy number profiles. Proceedings of the 19th Machine Learning in Computational Biology meeting, in Proceedings of Machine Learning Research 261:28-37 Available from https://proceedings.mlr.press/v261/gadd24a.html.

Related Material