NVIDIA Blog 10月29日 22:47
利用NVIDIA Omniverse和Cosmos加速物理AI模型的合成数据生成
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

物理AI模型,如机器人和自动驾驶汽车,需要安全、泛化性强且能实时感知和操作的AI。与大型语言模型不同,它们需要基于真实世界的数据进行训练,但真实世界数据的收集既困难又危险。NVIDIA发布了NVIDIA Cosmos开放世界基础模型(WFMs)的更新,通过NVIDIA Omniverse库和Cosmos,开发者可以大规模生成物理基础的合成数据。Cosmos Predict 2.5集成了Text2World、Image2World和Video2World,能从单一输入生成一致、可控的多摄像头视频世界。Cosmos Transfer 2.5支持高保真、空间控制的世界风格迁移,可为模拟环境添加新的天气、光照和地形条件。这些WFMs可集成到NVIDIA Isaac Sim中,生成逼真的视频,缩小模拟与现实的差距。多家领先的机器人和AI公司如Skild AI和Serve Robotics已开始利用这些技术加速物理AI的开发。

🤖 **物理AI模型的挑战与合成数据的重要性**:物理AI模型,如驱动机器人和自动驾驶汽车的AI,必须在现实世界中安全、泛化且实时运行。与能从互联网海量数据训练的语言模型不同,物理AI依赖于真实世界数据,但收集这些数据成本高昂且充满风险。因此,基于物理的合成数据生成成为解决这一数据鸿沟的关键方法。

💡 **NVIDIA Cosmos与Omniverse驱动的合成数据生成**:NVIDIA更新了其NVIDIA Cosmos开放世界基础模型(WFMs),并结合NVIDIA Omniverse库,使开发者能够大规模生成物理基础的合成数据。Cosmos Predict 2.5通过整合Text2World、Image2World和Video2World,能从单一图像、视频或文本提示生成一致且可控的多摄像头视频世界。Cosmos Transfer 2.5则支持高保真的风格迁移,允许在模拟环境中添加多样化的天气、光照和地形条件,以增强数据的变异性。

🚀 **实际应用与行业赋能**:多家领先的机器人和AI公司,如Skild AI和Serve Robotics,已积极采用这些技术加速物理AI的开发。Skild AI利用Cosmos Transfer增强数据多样性,在NVIDIA Isaac Lab中训练机器人策略;Serve Robotics则利用NVIDIA Isaac Sim生成的合成数据训练其自动驾驶机器人,已成功完成大规模的最后一英里配送。这些案例表明,合成数据生成正在显著降低AI开发的时间和成本。

🛠️ **完整的合成数据生成流程**:NVIDIA提供了一个四步流程来生成合成数据:首先使用NVIDIA Omniverse NuRec库从智能手机数据重建OpenUSD数字孪生;其次,使用SimReady资产填充数字孪生;接着,在Isaac Sim中使用MobilityGen工作流生成合成数据;最后,利用NVIDIA Cosmos增强生成的数据。这一流程旨在提供一个端到端的解决方案,以支持物理AI模型的开发。

Editor’s note: This post is part of Into the Omniverse, a series focused on how developers, 3D practitioners and enterprises can transform their workflows using the latest advances in OpenUSD and NVIDIA Omniverse.

Physical AI models — which power robots, autonomous vehicles and other intelligent machines — must be safe, generalized for dynamic scenarios and capable of perceiving, reasoning and operating in real time. Unlike large language models that can be trained on massive datasets from the internet, physical AI models must learn from data grounded in the real world.

However, collecting sufficient data that covers this wide variety of scenarios in the real world is incredibly difficult and, in some cases, dangerous. Physically based synthetic data generation offers a key way to address this gap.

NVIDIA recently released updates to NVIDIA Cosmos open world foundation models (WFMs) to accelerate data generation for testing and validating physical AI models. Using NVIDIA Omniverse libraries and Cosmos, developers can generate physically based synthetic data at incredible scale.

Cosmos Predict 2.5 now unifies three separate models — Text2World, Image2World and Video2World — into a single lightweight architecture that generates consistent, controllable multicamera video worlds from a single image, video or prompt.

Cosmos Transfer 2.5 enables high-fidelity, spatially controlled world-to-world style transfer to amplify data variation. Developers can add new weather, lighting and terrain conditions to their simulated environments across multiple cameras. Cosmos Transfer 2.5 is 3.5x smaller than its predecessor, delivering faster performance with improved prompt alignment and physics accuracy.

These WFMs can be integrated into synthetic data pipelines running in the NVIDIA Isaac Sim open-source robotics simulation framework, built on the NVIDIA Omniverse platform, to generate photorealistic videos that reduce the simulation-to-real gap. Developers can reference a four-part pipeline for synthetic data generation:

From Simulation to the Real World

Leading robotics and AI companies are already using these technologies to accelerate physical AI development.

Skild AI, which builds general-purpose robot brains, is using Cosmos Transfer to augment existing data with new variations for testing and validating robotics policies trained in NVIDIA Isaac Lab.

Skild AI uses Isaac Lab to create scalable simulation environments where its robots can train across embodiments and applications. By combining Isaac Lab robotics simulation capabilities with Cosmos’ synthetic data generation, Skild AI can train robot brains across diverse conditions without the time and cost constraints of real-world data collection.

Serve Robotics uses synthetic data generated from thousands of simulated scenarios in NVIDIA Isaac Sim. The synthetic data is then used in conjunction with real data to train physical AI models. The company has built one of the largest autonomous robot fleets operating in public spaces and has completed over 100,000 last-mile meal deliveries across urban areas. Serve’s robots collect 1 million miles of data monthly, including nearly 170 billion image-lidar samples, which are used in simulation to further improve robot models.

Learn more about how Serve Robotics uses Isaac Sim to accelerate development, testing and deployment of its sidewalk delivery robots by watching the below livestream.

Beyond bringing people meals, Serve recently used its robots to deliver compute power — dropping off brand-new NVIDIA DGX Spark personal AI supercomputers to Refik Anadol, Will.I.AM and Ollama. With 1 petaflop of AI performance, DGX Spark offers developers desktop capabilities for workflows from AI model prototyping and model fine-tuning to inference and robotics development.

Autonomous drone delivery company Zipline also participated in the DGX Spark drop, with Chief Hardware Officer Jo Mardall receiving a DGX Spark by drone at the company’s headquarters and testing facility in Half Moon Bay, California. Zipline uses the NVIDIA Jetson edge AI and robotics platform for its drone delivery systems.

See How Developers Are Using Synthetic Data 

Lightwheel, a simulation-first robotics solution provider, is helping companies bridge the simulation-to-real gap with SimReady assets and large-scale synthetic datasets. With high-quality synthetic data and simulation environments built on OpenUSD, Lightwheel’s approach helps ensure robots trained in simulation perform effectively in real-world scenarios, from factory floors to homes.

Data scientist and Omniverse community member Santiago Villa is using synthetic data with Omniverse libraries and Blender software to improve mining operations by identifying large boulders that halt operations.

Undetected boulders entering crushers can cause delays of seven minutes or more per incident, costing mines up to $650,000 annually in lost production. Using Omniverse to generate thousands of automatically annotated synthetic images across varied lighting and weather conditions dramatically reduces training costs while enabling mining companies to improve boulder detection systems and avoid equipment downtime.

FS Studio partnered with a global logistics leader to improve AI-driven package detection by creating thousands of photorealistic package variations in different lighting conditions using Omniverse libraries like Replicator. The synthetic dataset dramatically improved object detection accuracy and reduced false positives, delivering measurable gains in throughput speed and system performance across the customer’s logistics network.

Images courtesy of FS Studio

Robots for Humanity built a full simulation environment in Isaac Sim for an oil and gas client using Omniverse libraries to generate synthetic data, including depth, segmentation and RGB images, while collecting joint and motion data from the Unitree G1 robot through teleoperation.

Image courtesy of Robots for Humanity.

Omniverse Ambassador Scott Dempsey is developing a synthetic data generation synthesizer that builds various cables from real-world manufacturer specifications, using Isaac Sim to generate synthetic data augmented with Cosmos Transfer to create photorealistic training datasets for applications that detect and handle cables.

Get Plugged Into the World of OpenUSD

Learn more about OpenUSD, Cosmos and synthetic data for physical AI by exploring these resources:

Stay up to date by subscribing to NVIDIA Omniverse news, joining the Omniverse community and following Omniverse on Discord, Instagram, LinkedIn, Threads, X and YouTube

Explore the Alliance for OpenUSD forum and the AOUSD website.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

NVIDIA Omniverse NVIDIA Cosmos 合成数据 物理AI 机器人 自动驾驶 Isaac Sim OpenUSD AI开发 NVIDIA Synthetic Data Physical AI Robotics Autonomous Driving AI Development
相关文章