logo

ChainThink

Stay ahead, master crypto insights

NVIDIA Open-Sources Lyra 2.0, Turning a Single Photo into a Walkable 3D World, Directly Importable into Robot Simulators

NVIDIA Open-Sources Lyra 2.0, Turning a Single Photo into a Walkable 3D World, Directly Importable into Robot Simulators

2026-04-16 08:31

View Original

ChainThink News, April 16: According to Beating's monitoring, NVIDIA has released the open-source Lyra 2.0 framework, which enables the generation of exploratory 3D worlds from a single image. After users upload a photo, Lyra 2.0 first generates a walkthrough video controlled by camera trajectory, then reconstructs the video into a 3D Gaussian Splatting (Gaussian Splats) and mesh model, directly importable into game engines and simulators for real-time rendering.


The model weights and code are released under the Apache 2.0 license on Hugging Face and GitHub, permitting commercial use. Its core technical breakthrough lies in addressing two degradation issues in long-distance walkthroughs: first, "spatial forgetting," where Lyra 2.0 maintains 3D geometric information per frame, resolving inconsistencies in scene depth when the camera retraces its path; second, "temporal drift," mitigated through self-enhanced training that enables the model to correct errors, preventing cumulative frame-by-frame inaccuracies that distort the scene. The framework is built upon Wan 2.1-14B diffusion Transformer as its underlying architecture, with an output resolution of 832×480.


One of Lyra 2.0’s primary application scenarios is robotic simulation. In NVIDIA’s demonstration, the generated 3D scenes were imported into its proprietary physics simulator, Isaac Sim, enabling robots to perform navigation and interaction tasks. Previously, a major bottleneck in embodied intelligence training was the high cost and limited variety of 3D environment creation—Lyra 2.0 provides a scalable pipeline for batch-generating training environments from photographs. Compared to Lyra 1.0, released in September last year, Lyra 2.0 extends the generation scope to continuous long-range exploration. While Google’s earlier Genie 3 demonstrated similar capabilities, it remained non-open-source. Lyra 2.0 is currently the most comprehensive open-source solution in this domain.

Disclaimer: Contains third-party opinions, does not constitute financial advice

Recommended Reading
The total liquidation value across the entire network in the past 24 hours reached $185 million, with both long and short positions being liquidated.
The total liquidation value across the entire network in the past 24 hours reached $185 million, with both long and short positions being liquidated.
Iranian officials set to depart Pakistan, holding no meetings with U.S. side throughout the journey
Iranian officials set to depart Pakistan, holding no meetings with U.S. side throughout the journey
Aave Joins Multiple Parties in Submitting Proposal to Arbitrum DAO to Unlock Frozen ETH and Restore rsETH Support
Aave Joins Multiple Parties in Submitting Proposal to Arbitrum DAO to Unlock Frozen ETH and Restore rsETH Support
Day 6 of the rsETH incident: DeFi United secures approximately $100 million in committed funding intentions, yet a shortfall of $50 million remains.
Day 6 of the rsETH incident: DeFi United secures approximately $100 million in committed funding intentions, yet a shortfall of $50 million remains.
Iranian MP: Iran Has Formed an Integrated Strategy for Managing the Strait of Hormuz
Iranian MP: Iran Has Formed an Integrated Strategy for Managing the Strait of Hormuz
DeepSeek Plans to Raise $1.8 Billion in Funding, Valued at Approximately $20 Billion
DeepSeek Plans to Raise $1.8 Billion in Funding, Valued at Approximately $20 Billion
Sources: Iran's stance has become firmer than during the first round of negotiations
Sources: Iran's stance has become firmer than during the first round of negotiations