3D场景理解_Fishai

热点

"3D场景理解" 相关文章

CORE-3D: Context-aware Open-vocabulary Retrieval by Embeddings in 3D

cs.AI updates on arXiv.org 2025-09-30T04:07:00.000000Z

Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy

cs.AI updates on arXiv.org 2025-09-30T04:06:45.000000Z

Text-Scene: A Scene-to-Language Parsing Framework for 3D Scene Understanding

cs.AI updates on arXiv.org 2025-09-23T05:47:06.000000Z

ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors

cs.AI updates on arXiv.org 2025-09-18T04:36:27.000000Z

Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models

cs.AI updates on arXiv.org 2025-07-18T04:14:06.000000Z

2025.04.10 | DDT提升图像生成质量；GenDoP优化相机轨迹生成。

HuggingFace 每日AI论文速递 2025-04-10T23:02:38.000000Z

Copyright © 2019 FISHAI.All Rights Reserved