OpenGround: 开放世界3D视觉定位

Research Paper #3D Visual Grounding, Zero-Shot Learning, Open-World Learning, Computer Vision, Artificial Intelligence 🔬 Research|分析: 2026年1月3日 19:20•

发布: 2025年12月28日 17:44

•

1分で読める

•ArXiv

分析

本文介绍了OpenGround，一个用于3D视觉定位的新框架，通过实现零样本学习和处理开放世界场景来解决现有方法的局限性。核心创新是基于主动认知的推理（ACR）模块，该模块动态扩展了模型的认知范围。本文的意义在于它能够处理未定义或未知的目标，使其适用于更多样化和更真实的3D场景理解任务。OpenTarget数据集的引入通过提供一个用于评估开放世界定位性能的基准，进一步促进了该领域的发展。

要点

引用 / 来源

查看原文

"The Active Cognition-based Reasoning (ACR) module performs human-like perception of the target via a cognitive task chain and actively reasons about contextually relevant objects, thereby extending VLM cognition through a dynamically updated OLT."

ArXiv2025年12月28日 17:44

* 根据版权法第32条进行合法引用。

较旧

With Great Context Comes Great Prediction Power: Classifying Objects via Geo-Semantic Scene Graphs

较新

Reliability Analysis of a 1-out-of-n Cold Standby Redundant System under the Generalized Lindley Distribution

OpenGround: 开放世界3D视觉定位

分析

要点

相关分析

SpaceTimePilot：时空控制的生成视频渲染

量子混沌哈密顿量演化下的随机性生成

GaMO：几何感知扩散用于稀疏视角3D重建

📬 获取AI新闻

按类别浏览

热门话题

📬 获取AI新闻

按类别浏览

热门话题