Pei
Menu
Reinforcement Learning
Posts
Resources
Cacti-based Framework
Publications
Email Address:
[email protected]
[email protected]
Blog Stats
136,017 hits
State Action/Control
blogs.cuit.columbia.edu/p
Meta
Log in
Entries feed
Comments feed
WordPress.org
Derivative of Sigmoid Function
Sigmoid Function
Derivative of Sigmoid Function
Last posts
Symbolic Netlist to Innovus-friendly Netlist
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
Solving H-horizon, Stationary Markov Decision Problems In Time Proportional To Log(H)
Randomized Linear Programming Solves the Discounted Markov Decision Problem In Nearly-Linear (Sometimes Sublinear) Run Time
KL Divergence
The Asymptotic Convergence-Rate of Q-learning
Hierarchical Apprenticeship Learning, with Application to Quadruped Locomotion
Policy Gradient Methods
Actor-Critic Algorithms for Hierarchical Markov Decision Processes
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation
Sidebar