Nvidia Developer 9小时前
OpenFold3:AI驱动的蛋白质结构预测新里程碑
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenFold3标志着AI在蛋白质结构预测领域迈入新阶段,它将开放科学与企业级性能相结合,实现了对多链复合物、核酸和小分子配体的预测。得益于NVIDIA cuEquivariance、MMseqs2-GPU和NVIDIA FLARE等技术,OpenFold3在速度、规模和隐私保护方面取得了显著提升,为生物制药和生物技术团队提供了前所未有的强大工具。该模型已集成至NVIDIA NIM,并提供加速支持,用户可通过简单步骤部署和使用,加速药物研发和分子科学研究的进程。

💡 **OpenFold3扩展了AI蛋白质结构预测的能力范围**:该模型不仅能预测单个蛋白质的结构,还能处理更复杂的生物分子系统,包括多链蛋白质复合物、蛋白质与核酸(DNA/RNA)的相互作用,以及包含小分子配体的组装体。这使得AI能够更全面地模拟生命体的分子交互机制,为理解和设计生物系统提供更强大的工具。

🚀 **NVIDIA加速技术赋能OpenFold3实现高性能**:OpenFold3集成了多项NVIDIA技术,如cuEquivariance用于高效的3D对称性操作加速,MMseqs2-GPU用于快速的序列比对,以及NVIDIA FLARE支持隐私保护的联邦学习。这些优化显著提升了预测速度和处理大规模数据的能力,使得OpenFold3能够满足企业级应用的需求,并能在H100 Tensor Core GPU上实现秒级预测。

🤝 **开放科学与企业级协作的典范**:OpenFold3由OpenFold联盟开发,汇聚了多家生物制药公司的力量,遵循Class 1开放模型标准,确保了模型的透明度和可复现性。通过NVIDIA FLARE,不同机构可以在不共享敏感数据的情况下进行模型训练和优化,促进了全球范围内的安全、高效的合作,加速了新药研发和生物技术的创新。

🔧 **易于部署和使用的NVIDIA NIM服务**:OpenFold3已作为NVIDIA NIM(NVIDIA NeMo Inference Microservice)提供,用户可以轻松地将其部署在本地、集群或云环境中。通过简单的API调用,即可提交结构预测任务,并快速获得包含3D坐标和置信度指标(如pLDDT, pTM, ipTM)的预测结果,极大地简化了研究人员的工作流程。

​​For decades, one of biology’s deepest mysteries was how a string of amino acids folds itself into the intricate architecture of life. Researchers built painstaking simulations and statistical models, inching toward an answer but never crossing the threshold of prediction at scale. 

Then, deep learning changed everything. By learning the language of evolution directly from sequence data, AI began to uncover the hidden rules of molecular form, transforming structure prediction from an art into an engineering discipline.

Today, that transformation reaches a new milestone. OpenFold3 brings production-ready protein AI into the NVIDIA ecosystem, uniting open science with enterprise-grade performance. Developed by the OpenFold Consortium and accelerated by NVIDIA, OpenFold3 extends structure prediction beyond single proteins to model multi-chain complexes, nucleic acids, and small-molecule ligands—the complete grammar of biological interaction. 

With NVIDIA cuEquivariance for symmetry-aware GPU acceleration, compatibility with MMseqs2-GPU for rapid sequence search, and NVIDIA FLARE for federated training, OpenFold3 delivers unprecedented speed, scale, and privacy-preserving collaboration for biopharma and biotech teams worldwide. 

OpenFold3 is now available and, as an NVIDIA NIM, with additional acceleration. This post walks you through how to use the OpenFold3 NIM for your structure prediction work. 

Prerequisites

Structure prediction with the OpenFold3 NIM

With OpenFold3 NIM, structure prediction can move from prototype to production in just a few steps, as detailed below.

Step 1: Access the model

OpenFold3 NIM is available through build.nvidia.com. You can deploy the container locally, on a cluster, or as a managed NIM service.

docker pull nvcr.io/nim/openfold/openfold3:latestexport LOCAL_NIM_CACHE=~/.cache/nimexport NGC_API_KEY=<Your NGC API Key>docker run --rm --name openfold3 \    --runtime=nvidia \    --gpus 'device=0' \    -p 8000:8000 \    -e NGC_API_KEY \    -v $LOCAL_NIM_CACHE:/opt/nim/.cache \    --shm-size=16g \    nvcr.io/nim/openfold/openfold3:latest

Step 2: Submit a structure prediction job

Once deployed, you can interact with the API using standard REST calls or Python clients:

#!/usr/bin/env python3import requestsimport osimport jsonfrom pathlib import Path# Define output file and inference endpointoutput_file = "output.json"url = "http://localhost:8000/biology/openfold/openfold3/predict"# Define protein sequenceprotein_sequence = "MGREEPLNHVEAERQRREKLNQRFYALRAVVPNVSKMDKASLLGDAIAYINELKSKVVKTESEKLQIKNQLEEVKLELAGRLEHHHHHH"# Define MSA alignment in CSV formatmsa_alignment_csv = "key,sequence\n-1,MGREEPLNHVEAERQRREKLNQRFYALRAVVPNVSKMDKASLLGDAIAYINELKSKVVKTESEKLQIKNQLEEVKLELAGRLEHHHHHH"# Define DNA sequences (complementary pair)dna_sequence_b = "AGGAACACGTGACCC"dna_sequence_c = "TGGGTCACGTGTTCC"# Build request datadata = {    "request_id": "5GNJ",    "inputs": [        {            "input_id": "5GNJ",            "molecules": [                {                    "type": "protein",                    "id": "A",                    "sequence": protein_sequence,                    "msa": {                        "main_db": {                            "csv": {                                "alignment": msa_alignment_csv,                                "format": "csv",                            }                        }                    }                },                {                    "type": "dna",                    "id": "B",                    "sequence": dna_sequence_b                },                {                    "type": "dna",                    "id": "C",                    "sequence": dna_sequence_c                }            ],            "output_format": "pdb"        }    ]}r = requests.post(url=url, json=data)# Save the json outputprint(r, "Saving to output.json:\n", r.text[:200], "...")Path(output_file).write_text(r.text)

Predictions include 3D coordinates (PDB/mmCIF) and confidence metrics such as pLDDT, pTM, and ipTM, all delivered in seconds on NVIDIA H100 Tensor Core GPUs.

A new open standard for protein structure prediction

The OpenFold Consortium, an industry-led coalition including Bayer, Bristol Myers Squibb, Johnson & Johnson, Novo Nordisk, Outpace Bio, and others, has been instrumental in advancing open, reproducible modeling systems.

OpenFold3 represents the consortium’s most significant milestone yet. The model extends structure prediction to multimers, protein–DNA/RNA complexes, and ligand-bound assemblies, achieving accuracy that meets or exceeds leading open-source models. 

Notably, OpenFold3 reaches parity with AlphaFold3 performance on protein–nucleic acid benchmarks, an area where earlier models have traditionally lagged. It is also classified as a Class 1 open-source system under the Linux Foundation open model definitions, ensuring full transparency and reproducibility.

Open science meets enterprise reliability

OpenFold3 is optimized for the NVIDIA accelerated AI computing stack, including:

    cuEquivariance: Physics-aware acceleration for 3D symmetry operations.MMseqs2-GPU: compatible with this GPU-native multiple sequence alignment tool.NVIDIA FLARE: Compatible with federated learning for cross-institutional fine-tuning without data sharing.

Together, these integrations make OpenFold3 NIM both developer-accessible and enterprise-deployable—a drop-in service for on-prem, hybrid, and cloud environments. NVIDIA TensorRT enables up to 1.8x faster inference for large multimers and nucleic acid complexes.

OpenFold3 has been validated in secure federated workflows by Apheris and SandboxAQ, proving its ability to scale across global pharma R&D environments. Federated pipelines enable partners to fine-tune on proprietary data, such as antibody–antigen complexes or RNA–ligand assemblies, without moving datasets across institutional boundaries.

And because OpenFold3 is a Class 1 open system according to the Linux Foundation open model definitions, the software and consortium benefit from a rapidly growing ecosystem of contributors and benchmarks. This ensures continuous improvement and long-term reliability.

With NVIDIA FLARE integration, organizations can train OpenFold3 collaboratively across multiple sites, such as pharma partners, research consortia, and hospitals, without sharing sensitive data.

This approach supports regulatory compliance (GDPR and HIPAA, for example) while unlocking improvements to models from diverse datasets that would otherwise remain siloed.

Building the future of open protein AI

OpenFold3 is more than a model. It’s a foundation for the next decade of protein AI. It reflects the convergence of more than 40 institutions at OpenFold Consortium, open source science, accelerated computing, and federated collaboration, ensuring that the tools used by global researchers can also meet enterprise reliability and security standards.

Acknowledgments

Special thanks to the OpenFold Consortium and partners, including SandboxAQ and Apheris, for their collaboration in advancing open, accelerated AI for molecular science.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

OpenFold3 蛋白质结构预测 AI 深度学习 NVIDIA 生物技术 药物研发 开放科学 Protein Structure Prediction Artificial Intelligence Deep Learning Biotechnology Drug Discovery Open Science
相关文章