Robust Risk-Sensitive RL with Bayesian DP
Analysis
Key Takeaways
- •Proposes a novel RSRL framework robust to transition uncertainty.
- •Unifies and generalizes existing RL frameworks.
- •Develops a Bayesian DP algorithm with strong consistency guarantees.
- •Demonstrates advantages in risk-sensitivity and robustness.
- •Validates the approach through numerical experiments, including option hedging.
“The Bayesian DP algorithm alternates between posterior updates and value iteration, employing an estimator for the risk-based Bellman operator that combines Monte Carlo sampling with convex optimization.”