Abstract
We study the problem of controlling linear time- invariant systems with known noisy dynamics and adversarially chosen quadratic losses. We present the first efficient online learning algorithms in this setting that guarantee O(Vf) regret under mild assumptions, where T is the time horizon. Our algorithms rely on a novel SDP relaxation for the steady-state distribution of the system. Crucially, and in contrast to previously proposed relaxations, the feasible solutions of our SDP all correspond to "strongly stable" policies that mix exponentially fast to a steady state.
| Original language | English |
|---|---|
| Title of host publication | 35th International Conference on Machine Learning, ICML 2018 |
| Editors | Andreas Krause, Jennifer Dy |
| Publisher | International Machine Learning Society (IMLS) |
| Pages | 1667-1681 |
| Number of pages | 15 |
| ISBN (Electronic) | 9781510867963 |
| State | Published - 2018 |
| Event | 35th International Conference on Machine Learning, ICML 2018 - Stockholm, Sweden Duration: 10 Jul 2018 → 15 Jul 2018 |
Publication series
| Name | 35th International Conference on Machine Learning, ICML 2018 |
|---|---|
| Volume | 3 |
Conference
| Conference | 35th International Conference on Machine Learning, ICML 2018 |
|---|---|
| Country/Territory | Sweden |
| City | Stockholm |
| Period | 10/07/18 → 15/07/18 |
Bibliographical note
Publisher Copyright:© 2018 35th International Conference on Machine Learning, ICML 2018. All rights reserved.
Fingerprint
Dive into the research topics of 'Online Linear Quadratic Control'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver