Nvidia Developer 09月25日
NVIDIA ReaSyn:AI驱动的分子合成路径预测模型
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了NVIDIA提出的ReaSyn模型,一个用于预测分子合成路径的生成式AI模型。ReaSyn借鉴了大型语言模型(LLMs)的链式思考(Chain-of-Thought, CoT)概念,创新性地提出了链式反应(Chain-of-Reaction, CoR)表示法,将复杂的合成过程分解为一系列可管理的反应步骤。通过这种方式,ReaSyn能够更有效地学习化学反应规则,并生成更可靠的多步合成路径。该模型还结合了强化学习微调和测试时搜索等LLM的推理策略,以增强路径生成的探索性和方向性。ReaSyn在逆合成规划、可合成性目标导向分子优化以及可合成性命中扩展等任务上表现出色,为药物发现等领域提供了强大的工具。

🔬 **链式反应(CoR)表示法革新合成路径预测**:ReaSyn模型的核心创新在于其链式反应(CoR)表示法,该方法受大型语言模型(LLMs)的链式思考(CoT)启发。CoR将复杂的分子合成路径表示为一系列线性的化学反应步骤,每一步都明确包含反应物、反应规则和产物。这种结构化表示不仅模仿了化学家实际的推理过程,还使得模型能够获得每一步的中间监督信号,从而更深入地学习化学反应规则,并生成更可靠的多步合成路径。

🧠 **借鉴LLM推理策略提升模型能力**:ReaSyn将分子合成路径的生成过程视为一种推理任务,并借鉴了LLMs在解决复杂问题时使用的推理策略。具体而言,它采用了两种关键技术:一是基于结果的强化学习(RL)微调,通过奖励机制鼓励模型探索更多样化的合成路径;二是目标导向的测试时搜索,利用束搜索(beam search)等方法,在生成过程中引导模型朝着期望的方向前进,从而提高路径生成的效率和准确性。

🚀 **多功能应用场景赋能分子设计**:ReaSyn模型展现出强大的多功能性,能够应用于多种分子设计和优化任务。它不仅可以用于逆合成规划,为目标分子找到可行的合成路线;还能将不可合成的分子“投影”到可合成的化学空间,生成可实际制备的类似物;同时支持目标导向的分子优化,以实现特定的化学性质;并能应用于命中扩展,快速发现目标分子的多样化可合成类似物,极大地提高了药物发现等领域的效率。

💡 **解决实际化学合成中的关键挑战**:在药物、化学品和材料的开发过程中,分子的可合成性是决定其价值的关键因素。传统的合成路径预测方法存在局限性。ReaSyn通过其创新的生成框架,能够高效地预测分子合成路径,并解决现有方法的不足,为化学家提供了一种强大的工具,以确保设计的分子能够被实际合成出来,从而加速创新进程。

A recurring challenge in molecular design, whether for pharmaceutical, chemical, or material applications, is creating synthesizable molecules. Synthesizability assessment often requires mapping the synthesis pathway for a molecule: the sequence of chemical reactions needed to transform precursor molecules into the target product molecule. This post introduces ReaSyn, a generative model from NVIDIA designed for predicting molecular synthesis pathways that also addresses limitations in current approaches.

Why chain-of-thought reasoning matters for AI in chemistry

Large language models (LLMs) have become integral to daily life, powering applications from virtual assistants to complex problem-solving. Modern LLMs solve complex problems by generating a chain of thought (CoT), which is a series of intermediate reasoning steps that lead to a final answer. Combining CoT and test-time search methods, such as generating multiple CoT paths, are critical to the improved accuracy of recent LLMs.

Chemistry faces a similar challenge in molecular synthesis pathway prediction, where a pathway contains a series of intermediate synthesis steps. Pathway prediction is a critical step in drug, chemical, and materials development because a molecule, however promising, is only valuable if it can‌ be synthesized. ReaSyn is a novel generative framework that efficiently predicts molecular synthesis pathways. It uses a unique chain of reaction (CoR) notation, inspired by the CoT approach in LLMs, combined with a test-time search algorithm.

ReaSyn: treating synthetic pathways as CoR

A synthetic pathway follows a bottom-up tree structure: simple molecules, or building blocks (BB), are combined through chemical reactions (RXN) to produce intermediate products (INT), which in turn undergo further reactions to form increasingly complex molecules (Figure 1). This process is multi-step, with each reaction applied to reactants that may be either building blocks or intermediates. In practice, chemists deduce such pathways step-by-step, reasoning through each transformation to reach the final target molecule.

Figure 1. CoR notation views synthetic pathways as CoT reasoning paths

ReaSyn captures this step-by-step reasoning through its CoR notation, inspired by the CoT approach in LLMs. In CoR, an entire synthetic pathway is represented as a linear sequence where each step explicitly includes the reactants, the reaction rule, and the resulting product. Reactants and products are encoded as SMILES (strings, wrapped with special tokens marking their boundaries), while each reaction is denoted by a single reaction-class token. This representation not only mirrors how chemists think about synthesis but also enables the model to receive intermediate supervision at every step for richer learning of chemical reaction rules and more reliable multi-step pathway generation.

Building on the CoR notation and its sequential design, ReaSyn operates as an autoregressive generative model (Figure 2) that treats synthetic pathways as reasoning chains—each step corresponding to a single chemical reaction. Just as CoT reasoning in LLMs produces intermediate steps before a final answer, ReaSyn incrementally constructs a pathway starting from simple building blocks to a target molecule. This enables it to reconstruct pathways for synthesizable molecules and to project unsynthesizable molecules into synthesizable chemical space, producing close synthesizable analogs that can be made in practice.

After the model predicts the reactants and reaction rules at each step, intermediate products can be easily obtained using a reaction executor such as RDKi. This intermediate information not only provides richer training signals for the model to learn chemical rules, but also guides the synthetic pathway generation process.

Enhancing pathway generation: applying LLM reasoning strategies

Viewing synthetic pathway generation as LLM reasoning also enables us to adopt additional LLM reasoning techniques such as reinforcement learning (RL) finetuning and test-time search. ReaSyn explores these two techniques for synthetic pathway generation.

How outcome-based RL finetuning improves exploration

There can be multiple pathways that lead to the same product molecule. ReaSyn benefits from sampling various synthetic pathways and getting feedback via GRPO). The outcome-based reward for a pathway is set to the molecular similarity between its end-product molecule and the input molecule. As the reward is only applied to the outcome, instead of the reasoning steps, the model is endowed with the ability to explore diverse synthetic pathways.

Figure 3. Reinforcement learning (RL) finetuning of ReaSyn using GRPO

Goal-directed search: guiding pathways

During generation, ReaSyn uses beam search, which maintains a pool of sequences being generated and expands them block-by-block (BB or RXN). The search enables ReaSyn to generate diverse pathways for a single input molecule, and guides the generation in a preferred direction by scoring the sequences through a reward function. In retrosynthesis planning, the reward function can be the similarity to the input molecule. In goal-directed optimization tasks, the reward function can be the desired chemical property.

Figure 4. Goal-directed test-time search of ReaSyn

Generating synthetic pathways with ReaSyn

ReaSyn’s synthesizable projection is highly versatile: it enables retrosynthesis planning, suggests analogs for unsynthesizable molecules, supports goal-directed molecular optimization, and facilitates synthesizable hit expansion. Below, we examine its performance on these tasks.

Retrosynthesis planning

Table 1. Retrosynthesis planning success rates (%)

Even given a vast synthesizable space, previous synthesizable molecule generation methods have struggled to cover this space extensively. ReaSyn shows a high success rate in generating synthetic pathways given synthesizable molecules, demonstrating its powerful explorability in the synthesizable chemical space.

Synthesizable goal-directed molecular optimization

Table 2. Average synthesizable optimization scores of 15 PMO molecular optimization tasks

ReaSyn can project molecules generated by an off-the-shelf molecular optimization method to perform synthesizable goal-directed optimization. Combined with Graph GA, Graph GA-ReaSyn shows higher optimization performance than previous synthesis-based methods.

Synthesizable hit expansion: exploring molecule neighborhoods

The search scheme enables ReaSyn to suggest multiple synthesizable analogs for a given target molecule by projecting them differently. ReaSyn explores the neighborhood of given molecules in synthesizable space, and can be applied to hit expansion to find diverse synthesizable analogs of hit molecules (Figure 5).

Figure 5. Synthesizable hit expansion with ReaSyn

Empowering drug discovery with advanced reasoning

Most generative models create molecules that aren’t synthesizable in practice. ReaSyn builds on recent reasoning advances in LLMs, equipping scientists with an effective generative tool to project small molecules into the synthesizable chemical space. With its enhanced reasoning capabilities, diversity, and versatility, ReaSyn shows promise as a means for navigating combinatorially large synthesizable chemical space in real-world drug discovery.

Find out more about ReaSyn by reading our paper on arXiv and the code is available on GitHub.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ReaSyn NVIDIA 分子合成 AI 药物发现 化学 生成模型 ReaSyn NVIDIA Molecular Synthesis AI Drug Discovery Chemistry Generative Models
相关文章