Search:
Match:
1 results

Analysis

This paper presents a novel single-index bandit algorithm that addresses the curse of dimensionality in contextual bandits. It provides a non-asymptotic theory, proves minimax optimality, and explores adaptivity to unknown smoothness levels. The work is significant because it offers a practical solution for high-dimensional bandit problems, which are common in real-world applications like recommendation systems. The algorithm's ability to adapt to unknown smoothness is also a valuable contribution.
Reference

The algorithm achieves minimax-optimal regret independent of the ambient dimension $d$, thereby overcoming the curse of dimensionality.