Enzyme sequence optimisation via Gromov-Wasserstein Autoencoders integrating MSA techniques

Enzyme sequence design has always been a challenging task, particularly in optimising key properties such as enzyme solubility, stability, and activity. This study proposes an innovative approach by utilising a variational autoencoder (VAE) model integrated with the Gromov-Wasserstein (GW) distance...

Full description

Saved in:
Bibliographic Details
Main Authors: Xuze Wang, Yangyang Li, Xiancong Hou, Hao Liu
Format: Article
Language:English
Published: Taylor & Francis Group 2025-12-01
Series:Journal of Enzyme Inhibition and Medicinal Chemistry
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/14756366.2025.2524742
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Enzyme sequence design has always been a challenging task, particularly in optimising key properties such as enzyme solubility, stability, and activity. This study proposes an innovative approach by utilising a variational autoencoder (VAE) model integrated with the Gromov-Wasserstein (GW) distance for enzyme sequence optimisation. The GWAE model improves representation learning by using the GW distance, thereby generating functional variants with desired characteristics. We also introduce an innovative enzyme dataset construction method that incorporates multiple sequence alignment (MSA) techniques to address sequence length discrepancies, enhancing the accuracy of the optimisation process. Experimental results show that the GWAE model outperforms the traditional VAE on multiple metrics. The generated enzyme sequences demonstrate superior solubility, stability, and hydrophobicity. Additionally, by integrating AlphaFold3 for structural prediction, we verify the structural stability of the generated sequences, further enhancing their practical applicability.
ISSN:1475-6366
1475-6374