Graph learning for capturing long-range dependencies in protein structures

Ali Hariri, Pierre Vandergheynst
Proceedings of the 19th Machine Learning in Computational Biology meeting, PMLR 261:117-128, 2024.

Abstract

Polyamides, or peptides and proteins, are biomolecules that exist in a broad spectrum of size, structure, and function. Both structure and function are defined by the underlying sequence of amino acids, causing the polyamide to take three-dimensional conformations when in solution. Despite significant efforts and advances in function and conformation prediction, there remains a critical need for computational methods to accurately infer protein function from sequence and structure. Recent advancements in deep learning, particularly Graph Neural Networks, have shown promise in learning the sequence and structure of proteins. However, they fail to capture essential long-range dependencies inherent in the complex and dynamic three-dimensional structures of proteins, leading to issues including oversquashing and oversmoothing. Here, we explore solutions to the challenge of capturing long-range dependencies in graph representations of polyamides, focusing on latent nodes and graph rewiring techniques. While graph rewiring enhances information flow between distant nodes, latent nodes enable the concentration of global information. In addition, we investigate the effectiveness of ChebNet, a spectral backbone, in capturing long-range dependencies. Our unified framework combines these approaches to address the limitations of current methods, offering insights into protein function and regulation. Through experimental analysis, we demonstrate the efficacy of our proposed methods in capturing long-range dependencies.

Cite this Paper


BibTeX
@InProceedings{pmlr-v261-hariri24a, title = {Graph learning for capturing long-range dependencies in protein structures}, author = {Hariri, Ali and Vandergheynst, Pierre}, booktitle = {Proceedings of the 19th Machine Learning in Computational Biology meeting}, pages = {117--128}, year = {2024}, editor = {Knowles, David A and Mostafavi, Sara}, volume = {261}, series = {Proceedings of Machine Learning Research}, month = {05--06 Sep}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v261/main/assets/hariri24a/hariri24a.pdf}, url = {https://proceedings.mlr.press/v261/hariri24a.html}, abstract = {Polyamides, or peptides and proteins, are biomolecules that exist in a broad spectrum of size, structure, and function. Both structure and function are defined by the underlying sequence of amino acids, causing the polyamide to take three-dimensional conformations when in solution. Despite significant efforts and advances in function and conformation prediction, there remains a critical need for computational methods to accurately infer protein function from sequence and structure. Recent advancements in deep learning, particularly Graph Neural Networks, have shown promise in learning the sequence and structure of proteins. However, they fail to capture essential long-range dependencies inherent in the complex and dynamic three-dimensional structures of proteins, leading to issues including oversquashing and oversmoothing. Here, we explore solutions to the challenge of capturing long-range dependencies in graph representations of polyamides, focusing on latent nodes and graph rewiring techniques. While graph rewiring enhances information flow between distant nodes, latent nodes enable the concentration of global information. In addition, we investigate the effectiveness of ChebNet, a spectral backbone, in capturing long-range dependencies. Our unified framework combines these approaches to address the limitations of current methods, offering insights into protein function and regulation. Through experimental analysis, we demonstrate the efficacy of our proposed methods in capturing long-range dependencies.} }
Endnote
%0 Conference Paper %T Graph learning for capturing long-range dependencies in protein structures %A Ali Hariri %A Pierre Vandergheynst %B Proceedings of the 19th Machine Learning in Computational Biology meeting %C Proceedings of Machine Learning Research %D 2024 %E David A Knowles %E Sara Mostafavi %F pmlr-v261-hariri24a %I PMLR %P 117--128 %U https://proceedings.mlr.press/v261/hariri24a.html %V 261 %X Polyamides, or peptides and proteins, are biomolecules that exist in a broad spectrum of size, structure, and function. Both structure and function are defined by the underlying sequence of amino acids, causing the polyamide to take three-dimensional conformations when in solution. Despite significant efforts and advances in function and conformation prediction, there remains a critical need for computational methods to accurately infer protein function from sequence and structure. Recent advancements in deep learning, particularly Graph Neural Networks, have shown promise in learning the sequence and structure of proteins. However, they fail to capture essential long-range dependencies inherent in the complex and dynamic three-dimensional structures of proteins, leading to issues including oversquashing and oversmoothing. Here, we explore solutions to the challenge of capturing long-range dependencies in graph representations of polyamides, focusing on latent nodes and graph rewiring techniques. While graph rewiring enhances information flow between distant nodes, latent nodes enable the concentration of global information. In addition, we investigate the effectiveness of ChebNet, a spectral backbone, in capturing long-range dependencies. Our unified framework combines these approaches to address the limitations of current methods, offering insights into protein function and regulation. Through experimental analysis, we demonstrate the efficacy of our proposed methods in capturing long-range dependencies.
APA
Hariri, A. & Vandergheynst, P.. (2024). Graph learning for capturing long-range dependencies in protein structures. Proceedings of the 19th Machine Learning in Computational Biology meeting, in Proceedings of Machine Learning Research 261:117-128 Available from https://proceedings.mlr.press/v261/hariri24a.html.

Related Material