Enzyme sequence optimisation via Gromov-Wasserstein Autoencoders integrating MSA techniques
Enzyme sequence design has always been a challenging task, particularly in optimising key properties such as enzyme solubility, stability, and activity. This study proposes an innovative approach by utilising a variational autoencoder (VAE) model integrated with the Gromov-Wasserstein (GW) distance...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2025-12-01
|
Series: | Journal of Enzyme Inhibition and Medicinal Chemistry |
Subjects: | |
Online Access: | https://www.tandfonline.com/doi/10.1080/14756366.2025.2524742 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Enzyme sequence design has always been a challenging task, particularly in optimising key properties such as enzyme solubility, stability, and activity. This study proposes an innovative approach by utilising a variational autoencoder (VAE) model integrated with the Gromov-Wasserstein (GW) distance for enzyme sequence optimisation. The GWAE model improves representation learning by using the GW distance, thereby generating functional variants with desired characteristics. We also introduce an innovative enzyme dataset construction method that incorporates multiple sequence alignment (MSA) techniques to address sequence length discrepancies, enhancing the accuracy of the optimisation process. Experimental results show that the GWAE model outperforms the traditional VAE on multiple metrics. The generated enzyme sequences demonstrate superior solubility, stability, and hydrophobicity. Additionally, by integrating AlphaFold3 for structural prediction, we verify the structural stability of the generated sequences, further enhancing their practical applicability. |
---|---|
ISSN: | 1475-6366 1475-6374 |