Pei
Menu
Reinforcement Learning
Posts
Resources
Cacti++
Publications
Email Address:
[email protected]
[email protected]
Blog Stats
140,610 hits
State Action/Control
blogs.cuit.columbia.edu/p
Meta
Log in
Entries feed
Comments feed
WordPress.org
Derivative of Sigmoid Function
Sigmoid Function
Derivative of Sigmoid Function
Last posts
Resume
Why does inserting or removing buffers help fix timing violations?
Software and Hardware vs Time by Grok
Club Elo
Technology Node vs Year
Symbolic Netlist to Innovus-friendly Netlist
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
Solving H-horizon, Stationary Markov Decision Problems In Time Proportional To Log(H)
Randomized Linear Programming Solves the Discounted Markov Decision Problem In Nearly-Linear (Sometimes Sublinear) Run Time
KL Divergence
Sidebar