热点
关于我们
xx
xx
"
3D场景理解
" 相关文章
CORE-3D: Context-aware Open-vocabulary Retrieval by Embeddings in 3D
cs.AI updates on arXiv.org
2025-09-30T04:07:00.000000Z
Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy
cs.AI updates on arXiv.org
2025-09-30T04:06:45.000000Z
Text-Scene: A Scene-to-Language Parsing Framework for 3D Scene Understanding
cs.AI updates on arXiv.org
2025-09-23T05:47:06.000000Z
ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors
cs.AI updates on arXiv.org
2025-09-18T04:36:27.000000Z
Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models
cs.AI updates on arXiv.org
2025-07-18T04:14:06.000000Z
2025.04.10 | DDT提升图像生成质量;GenDoP优化相机轨迹生成。
HuggingFace 每日AI论文速递
2025-04-10T23:02:38.000000Z