HAT: Adaptive Spatio-Temporal Alignment for 3D Perception
Analysis
Key Takeaways
- •Proposes HAT, a novel spatio-temporal alignment module for 3D perception.
- •HAT uses multiple motion models and multi-hypothesis decoding for optimal alignment.
- •Achieves state-of-the-art tracking results and improves perception accuracy in E2E AD.
- •Demonstrates robustness under corrupted semantic conditions.
“HAT consistently improves 3D temporal detectors and trackers across diverse baselines. It achieves state-of-the-art tracking results with 46.0% AMOTA on the test set when paired with the DETR3D detector.”