Search:
Match:
1 results

Analysis

This paper addresses the challenge of state ambiguity in robot manipulation, a common problem where identical observations can lead to multiple valid behaviors. The proposed solution, PAM (Policy with Adaptive working Memory), offers a novel approach to handle long history windows without the computational burden and overfitting issues of naive methods. The two-stage training and the use of hierarchical feature extraction, context routing, and a reconstruction objective are key innovations. The paper's focus on maintaining high inference speed (above 20Hz) is crucial for real-world robotic applications. The evaluation across seven tasks demonstrates the effectiveness of PAM in handling state ambiguity.
Reference

PAM supports a 300-frame history window while maintaining high inference speed (above 20Hz).