RGBT-Ground: A New Benchmark for Robust Visual Grounding in Real-World Scenarios
Analysis
Key Takeaways
- •Introduces RGBT-Ground, a new benchmark for visual grounding in complex real-world scenarios.
- •Utilizes RGB and Thermal Infrared (TIR) image pairs for more robust evaluation.
- •Provides a unified visual grounding framework and a baseline model (RGBT-VGNet).
- •Addresses limitations of existing benchmarks in terms of scene diversity and real-world conditions.
“RGBT-Ground, the first large-scale visual grounding benchmark built for complex real-world scenarios.”