Search:
Match:
1 results

Unified Embodied VLM Reasoning for Robotic Action

Published:Dec 30, 2025 10:18
1 min read
ArXiv

Analysis

This paper addresses the challenge of creating general-purpose robotic systems by focusing on the interplay between reasoning and precise action execution. It introduces a new benchmark (ERIQ) to evaluate embodied reasoning and proposes a novel action tokenizer (FACT) to bridge the gap between reasoning and execution. The work's significance lies in its attempt to decouple and quantitatively assess the bottlenecks in Vision-Language-Action (VLA) models, offering a principled framework for improving robotic manipulation.
Reference

The paper introduces Embodied Reasoning Intelligence Quotient (ERIQ), a large-scale embodied reasoning benchmark in robotic manipulation, and FACT, a flow-matching-based action tokenizer.