Dr. Pei
Electrical and Computer Engineering | STCO
Menu
Home
SPCsim
RL
Rankings
Posts
Cacti++
Bible
Email Address:
[email protected]
[email protected]
Blog Stats
1,874 hits
State Action/Control
blogs.cuit.columbia.edu/zp2130
Meta
Log in
Entries feed
Comments feed
WordPress.org
Compute Backpropagation Derivatives
Consider two features x
1
, x
2
for a single training set:
Last posts
Symbolic Netlist to Innovus-friendly Netlist
i
Probability
Reinforcement Learning is Direct Adaptive Optimal Control
Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach
Hierarchical Policy Gradient Algorithms
Ark and Park
Hierarchical Actor-Critic
RL Other Useful Reference
Policy Gradient and Q-learning
Sidebar
×
Learn
more