ViDa: Visualizing DNA hybridization trajectories with biophysics-informed deep graph embeddings

Chenwei Zhang, Jordan Lovrod, Boyan Beronov, Khanh Dao Duc, Anne Condon
Proceedings of the 18th Machine Learning in Computational Biology meeting, PMLR 240:148-162, 2024.

Abstract

Visualization tools can help synthetic biologists and molecular programmers understand the complex reactive pathways of nucleic acid reactions, which can be designed for many potential applications and can be modeled using a continuous-time Markov chain (CTMC). Here we present ViDa, a new visualization approach for DNA reaction trajectories that uses a 2D embedding of the secondary structure state space underlying the CTMC model. To this end, we integrate a scattering transform of the secondary structure adjacency, a variational autoencoder, and a nonlinear dimensionality reduction method. We augment the training loss with domain-specific supervised terms that capture both thermodynamic and kinetic features. We assess ViDa on two well-studied DNA hybridization reactions. Our results demonstrate that the domain-specific features lead to significant quality improvements over the state-of-the-art in DNA state space visualization, successfully separating different folding pathways and thus providing useful insights into dominant reaction mechanisms.

Cite this Paper


BibTeX
@InProceedings{pmlr-v240-zhang24a, title = {ViDa: Visualizing DNA hybridization trajectories with biophysics-informed deep graph embeddings}, author = {Zhang, Chenwei and Lovrod, Jordan and Beronov, Boyan and Dao Duc, Khanh and Condon, Anne}, booktitle = {Proceedings of the 18th Machine Learning in Computational Biology meeting}, pages = {148--162}, year = {2024}, editor = {Knowles, David A. and Mostafavi, Sara}, volume = {240}, series = {Proceedings of Machine Learning Research}, month = {30 Nov--01 Dec}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v240/zhang24a/zhang24a.pdf}, url = {https://proceedings.mlr.press/v240/zhang24a.html}, abstract = {Visualization tools can help synthetic biologists and molecular programmers understand the complex reactive pathways of nucleic acid reactions, which can be designed for many potential applications and can be modeled using a continuous-time Markov chain (CTMC). Here we present ViDa, a new visualization approach for DNA reaction trajectories that uses a 2D embedding of the secondary structure state space underlying the CTMC model. To this end, we integrate a scattering transform of the secondary structure adjacency, a variational autoencoder, and a nonlinear dimensionality reduction method. We augment the training loss with domain-specific supervised terms that capture both thermodynamic and kinetic features. We assess ViDa on two well-studied DNA hybridization reactions. Our results demonstrate that the domain-specific features lead to significant quality improvements over the state-of-the-art in DNA state space visualization, successfully separating different folding pathways and thus providing useful insights into dominant reaction mechanisms.} }
Endnote
%0 Conference Paper %T ViDa: Visualizing DNA hybridization trajectories with biophysics-informed deep graph embeddings %A Chenwei Zhang %A Jordan Lovrod %A Boyan Beronov %A Khanh Dao Duc %A Anne Condon %B Proceedings of the 18th Machine Learning in Computational Biology meeting %C Proceedings of Machine Learning Research %D 2024 %E David A. Knowles %E Sara Mostafavi %F pmlr-v240-zhang24a %I PMLR %P 148--162 %U https://proceedings.mlr.press/v240/zhang24a.html %V 240 %X Visualization tools can help synthetic biologists and molecular programmers understand the complex reactive pathways of nucleic acid reactions, which can be designed for many potential applications and can be modeled using a continuous-time Markov chain (CTMC). Here we present ViDa, a new visualization approach for DNA reaction trajectories that uses a 2D embedding of the secondary structure state space underlying the CTMC model. To this end, we integrate a scattering transform of the secondary structure adjacency, a variational autoencoder, and a nonlinear dimensionality reduction method. We augment the training loss with domain-specific supervised terms that capture both thermodynamic and kinetic features. We assess ViDa on two well-studied DNA hybridization reactions. Our results demonstrate that the domain-specific features lead to significant quality improvements over the state-of-the-art in DNA state space visualization, successfully separating different folding pathways and thus providing useful insights into dominant reaction mechanisms.
APA
Zhang, C., Lovrod, J., Beronov, B., Dao Duc, K. & Condon, A.. (2024). ViDa: Visualizing DNA hybridization trajectories with biophysics-informed deep graph embeddings. Proceedings of the 18th Machine Learning in Computational Biology meeting, in Proceedings of Machine Learning Research 240:148-162 Available from https://proceedings.mlr.press/v240/zhang24a.html.

Related Material