Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon

Ben Hambly,Renyuan Xu,Huining Yang

Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon

2021

Ben Hambly
Renyuan Xu
Huining Yang

We explore reinforcement learning methods for finding the optimal policy in the linear quadratic regulator (LQR) problem. In particular we consider the convergence of policy gradient methods in the...

Keywords:

Mathematics
Stochastic control
Applied mathematics
Convergence (routing)
Linear-quadratic regulator
Reinforcement learning
finite horizon

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations