Dr. Pei

Electrical and Computer Engineering | STCO

Stationary Distribution Archives - Dr. Pei

Policy Gradient Methods for Reinforcement Learning with Function Approximation

April 22, 2026

Policy Gradient Methods for Reinforcement Learning with Function Approximation Math Analysis Markov Decision Processes and Policy Gradient So far in this book almost all the methods have been action-value methods; they learned the values of actions and then selected actions based on their estimated action values; their policies would not even exist without the… read more »

Sidebar

Google Scholar

Dr. Pei

Email Address:

Blog Stats

State Action/Control

Meta

Stationary Distribution Archives - Dr. Pei

Policy Gradient Methods for Reinforcement Learning with Function Approximation