Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret.

Asaf Benjamin Cassel, Tomer Koren

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Original languageEnglish
Title of host publicationProceedings of the 38th International Conference on Machine Learning
EditorsMarina Meila, Tong Zhang
Pages1304-1313
Number of pages10
StatePublished - 2021
Event38th International Conference on Machine Learning, ICML 2021 - Virtual
Duration: 18 Jul 202124 Jul 2021
Conference number: 38

Publication series

NameProceedings of Machine Learning Research
PublisherPMLR
Volume139

Conference

Conference38th International Conference on Machine Learning, ICML 2021
Abbreviated titleICML 2021
Period18/07/2124/07/21

Cite this