Dr. Pei

Electrical and Computer Engineering | STCO

RL Archives - Page 3 of 3 - Dr. Pei

Policy Gradient Methods for Reinforcement Learning with Function Approximation

April 22, 2026

Policy Gradient Methods for Reinforcement Learning with Function Approximation Math Analysis Markov Decision Processes and Policy Gradient So far in this book almost all the methods have been action-value methods; they learned the values of actions and then selected actions based on their estimated action values; their policies would not even exist without the… read more »

Metric spaces

July 16, 2024

Metric spaces 度量空间及相关的一些知识点 Definition 6.1.5 (Convergence of sequences). Let ε > 0 be a real number, and let L be a real number. A sequence of real numbers is said to be ε-close to L iff an is ε-close to L for every n ≥ N, i.e., we have |an − L| ≤ ε… read more »

Reinforcement Learning with Soft State Aggregation

April 22, 2026

Reinforcement Learning with Soft State Aggregation Math Analysis – Present A New Approach Based On Bayes’ Theorem: Apply Clustering π Rather than State Lookup Table for Computing Q Value Problem Definition and Summary of Notation We consider the problem of solving large Markovian decision processes (MDPs) using RL algorithms and compact function approximation. The objective is to maximize… read more »

« Newer Entries

Sidebar

Google Scholar

Dr. Pei

Email Address:

Blog Stats

State Action/Control

Meta

RL Archives - Page 3 of 3 - Dr. Pei

Policy Gradient Methods for Reinforcement Learning with Function Approximation

Metric spaces

Reinforcement Learning with Soft State Aggregation