Clifford Group Equivariant Neural Network Layers for Protein Structure Prediction

Alberto Pepe, Sven Buchholz, Joan Lasenby
Proceedings of the 5th Northern Lights Deep Learning Conference ({NLDL}), PMLR 233:205-211, 2024.

Abstract

We employ Clifford Group Equivariant Neural Network (CGENN) layers to predict protein coordinates in a Protein Structure Prediction (PSP) pipeline. PSP is the estimation of the 3D structure of a protein, generally through deep learning architectures. Information about the geometry of the protein chain has been proven to be crucial for accurate predictions of 3D structures. However, this information is usually flattened as machine learning features that are not representative of the geometric nature of the problem. Leveraging recent advances in geometric deep learning, we redesign a PSP architecture with the addition of CGENN layers. CGENNs can achieve better generalization and robustness when dealing with data that show rotational or translational invariance such as protein coordinates, which are independent of the chosen reference frame. CGENNs inputs, outputs, weights and biases are objects in the Geometric Algebra of 3D Euclidean space, i.e. $\mathcal{G}_{3,0,0}$, and hence are interpretable from a geometrical perspective. We test 6 approaches to PSP and show that CGENN layers increase the prediction accuracy by up to 2.1%, with fewer trainable parameters compared to linear layers and give a clear geometric interpretation of their outputs.

Cite this Paper


BibTeX
@InProceedings{pmlr-v233-pepe24a, title = {Clifford Group Equivariant Neural Network Layers for Protein Structure Prediction}, author = {Pepe, Alberto and Buchholz, Sven and Lasenby, Joan}, booktitle = {Proceedings of the 5th Northern Lights Deep Learning Conference ({NLDL})}, pages = {205--211}, year = {2024}, editor = {Lutchyn, Tetiana and Ramírez Rivera, Adín and Ricaud, Benjamin}, volume = {233}, series = {Proceedings of Machine Learning Research}, month = {09--11 Jan}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v233/pepe24a/pepe24a.pdf}, url = {https://proceedings.mlr.press/v233/pepe24a.html}, abstract = {We employ Clifford Group Equivariant Neural Network (CGENN) layers to predict protein coordinates in a Protein Structure Prediction (PSP) pipeline. PSP is the estimation of the 3D structure of a protein, generally through deep learning architectures. Information about the geometry of the protein chain has been proven to be crucial for accurate predictions of 3D structures. However, this information is usually flattened as machine learning features that are not representative of the geometric nature of the problem. Leveraging recent advances in geometric deep learning, we redesign a PSP architecture with the addition of CGENN layers. CGENNs can achieve better generalization and robustness when dealing with data that show rotational or translational invariance such as protein coordinates, which are independent of the chosen reference frame. CGENNs inputs, outputs, weights and biases are objects in the Geometric Algebra of 3D Euclidean space, i.e. $\mathcal{G}_{3,0,0}$, and hence are interpretable from a geometrical perspective. We test 6 approaches to PSP and show that CGENN layers increase the prediction accuracy by up to 2.1%, with fewer trainable parameters compared to linear layers and give a clear geometric interpretation of their outputs.} }
Endnote
%0 Conference Paper %T Clifford Group Equivariant Neural Network Layers for Protein Structure Prediction %A Alberto Pepe %A Sven Buchholz %A Joan Lasenby %B Proceedings of the 5th Northern Lights Deep Learning Conference ({NLDL}) %C Proceedings of Machine Learning Research %D 2024 %E Tetiana Lutchyn %E Adín Ramírez Rivera %E Benjamin Ricaud %F pmlr-v233-pepe24a %I PMLR %P 205--211 %U https://proceedings.mlr.press/v233/pepe24a.html %V 233 %X We employ Clifford Group Equivariant Neural Network (CGENN) layers to predict protein coordinates in a Protein Structure Prediction (PSP) pipeline. PSP is the estimation of the 3D structure of a protein, generally through deep learning architectures. Information about the geometry of the protein chain has been proven to be crucial for accurate predictions of 3D structures. However, this information is usually flattened as machine learning features that are not representative of the geometric nature of the problem. Leveraging recent advances in geometric deep learning, we redesign a PSP architecture with the addition of CGENN layers. CGENNs can achieve better generalization and robustness when dealing with data that show rotational or translational invariance such as protein coordinates, which are independent of the chosen reference frame. CGENNs inputs, outputs, weights and biases are objects in the Geometric Algebra of 3D Euclidean space, i.e. $\mathcal{G}_{3,0,0}$, and hence are interpretable from a geometrical perspective. We test 6 approaches to PSP and show that CGENN layers increase the prediction accuracy by up to 2.1%, with fewer trainable parameters compared to linear layers and give a clear geometric interpretation of their outputs.
APA
Pepe, A., Buchholz, S. & Lasenby, J.. (2024). Clifford Group Equivariant Neural Network Layers for Protein Structure Prediction. Proceedings of the 5th Northern Lights Deep Learning Conference ({NLDL}), in Proceedings of Machine Learning Research 233:205-211 Available from https://proceedings.mlr.press/v233/pepe24a.html.

Related Material