Pei
Menu
Reinforcement Learning
Posts
Resources
Cacti++
Publications
Email Address:
[email protected]
[email protected]
Blog Stats
139,430 hits
State Action/Control
blogs.cuit.columbia.edu/p
Meta
Log in
Entries feed
Comments feed
WordPress.org
echo_$DISPLAY
echo $DISPLAY
shvcut03:29.0
gvim ~/.display
setenv DISPLAY shvcut03:29
Last posts
Why does inserting or removing buffers help fix timing violations?
Software and Hardware vs Time by Grok
Club Elo
Technology Node vs Year
Symbolic Netlist to Innovus-friendly Netlist
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
Solving H-horizon, Stationary Markov Decision Problems In Time Proportional To Log(H)
Randomized Linear Programming Solves the Discounted Markov Decision Problem In Nearly-Linear (Sometimes Sublinear) Run Time
KL Divergence
The Asymptotic Convergence-Rate of Q-learning
Sidebar