DarkEQA: Benchmarking VLMs for Low-Light Embodied Question Answering
Analysis
Key Takeaways
- •Introduces DarkEQA, a new benchmark for evaluating VLMs in low-light embodied question answering.
- •Employs a physically-realistic simulation of low-light conditions.
- •Enables attributable robustness analysis by isolating the perception bottleneck.
- •Evaluates state-of-the-art VLMs and LLIE models, revealing their limitations.
“DarkEQA isolates the perception bottleneck by evaluating question answering from egocentric observations under controlled degradations, enabling attributable robustness analysis.”