Search:
Match:
1 results

Analysis

This paper introduces MUSON, a new multimodal dataset designed to improve socially compliant navigation in urban environments. The dataset addresses limitations in existing datasets by providing explicit reasoning supervision and a balanced action space. This is important because it allows for the development of AI models that can make safer and more interpretable decisions in complex social situations. The structured Chain-of-Thought annotation is a key contribution, enabling models to learn the reasoning process behind navigation decisions. The benchmarking results demonstrate the effectiveness of MUSON as a benchmark.
Reference

MUSON adopts a structured five-step Chain-of-Thought annotation consisting of perception, prediction, reasoning, action, and explanation, with explicit modeling of static physical constraints and a rationally balanced discrete action space.