Menu

Regret Bounds for the Adaptive Control of Linear Quadratic Systems

calendar icon Aug 2, 2011 3824 views
split view icon
video icon
presentation icon
video with chapters icon
video thumbnail
Pause
Mute
speed icon
speed icon
0.25
0.5
0.75
1
1.25
1.5
1.75
2

We study the average cost Linear Quadratic (LQ) problem with unknown model parameters, also known as the adaptive control problem in the control community. We design an algorithm and prove that its regret up to time T is O(√T) apart from logarithmic factors. Unlike many classical approaches that use a forced-exploration scheme to provide the suffi cient exploratory information for parameter estimation, we construct a high-probability con fidence set around the model parameters and design an algorithms that plays optimistically with respect to this con fidence set. The construction of the con fidence set is based on the new results from online least-squares estimation and leads to improved worst-case regret bound for the proposed algorithm. To best of our knowledge this is the the fi rst time that a regret bound is derived for the LQ problem.

RELATED CATEGORIES

MORE VIDEOS FROM THE SAME CATEGORIES

Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.