# Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

### Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

finite-sample convergence rates for q-learning and indirect algorithms

finite-sample convergence rates for q-learning and indirect algorithms

- Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
- Solving H-horizon, Stationary Markov Decision Problems In Time Proportional To Log(H)
- Randomized Linear Programming Solves the Discounted Markov Decision Problem In Nearly-Linear (Sometimes Sublinear) Run Time
- KL Divergence
- The Asymptotic Convergence-Rate of Q-learning
- Hierarchical Apprenticeship Learning, with Application to Quadruped Locomotion
- Policy Gradient Methods
- Actor-Critic Algorithms for Hierarchical Markov Decision Processes
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
- Meta Learning Shared Hierarchies