ChiralCat: Molecular chirality classification with enhanced spatial representation using learnable queries

Molecular chirality is a key focus of research in chemistry and biology. In nature, there are many complex categories of chirality and it can strongly alter biochemical activities and interactions, particularly in asymmetric catalysis and protein–drug binding. Despite advancements in molecular prope...

Full description

Saved in:
Bibliographic Details
Main Authors: Yichuan Peng, Gufeng Yu, Runhan Shi, Letian Chen, Xi Wang, Wenjie Du, Xiaohong Huo, Yang Yang
Format: Article
Language:English
Published: Elsevier 2025-12-01
Series:Artificial Intelligence Chemistry
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2949747725000089
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Molecular chirality is a key focus of research in chemistry and biology. In nature, there are many complex categories of chirality and it can strongly alter biochemical activities and interactions, particularly in asymmetric catalysis and protein–drug binding. Despite advancements in molecular property prediction approaches, a computational method capable of identifying chiral types has been absent, impeding progress in chirality studies. This gap is primarily due to the inability of current molecular representation models to capture chiral-related spatial features and the scarcity of annotated datasets for complex chiral types. To address these limitations, we develop ChiralCat, a pioneering machine learning method for molecular chirality classification. ChiralCat’s core is a pre-trained multi-modal classifier that enhances spatial molecular representations. This is achieved through learnable queries, guided by chirality-related descriptions generated by a large language model (LLM). To facilitate the model’s training, we construct an extensive chiral molecule dataset comprising 17,181 molecules across various chiral categories. Our experimental results, both quantitative and visualized, reveal that ChiralCat outperforms existing 3D molecular representation learning models in capturing spatial information pertinent to chirality, thereby exhibiting superior capability in discerning complex chiral types.
ISSN:2949-7477