View-on-Graph：基于视觉-语言推理的零样本3D视觉定位，基于场景图

Research #3D Vision 🔬 Research|分析: 2026年1月10日 12:27•

发布: 2025年12月10日 00:59

•

1分で読める

分析

该论文可能提出了一种新的3D视觉定位方法，允许模型在没有事先针对特定对象-场景对进行训练的情况下，在3D空间中定位对象。这种基于场景图上的视觉-语言推理的零样本能力是该领域的一项重大进展。

引用 / 来源

"The core of the research involves zero-shot 3D visual grounding."

ArXiv2025年12月10日 00:59

* 根据版权法第32条进行合法引用。

CORE: Enhancing LLMs with a Conceptual Reasoning Layer

Conflict-Aware Framework for LLM Alignment Tackles Misalignment Issues