Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems

资源分类

2020-01-16 |

101 |

78 |

Abstract

We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes. More recently, for the average cost LQ problem, a regret bound of 图片.png was shown, apart form logarithmic factors. However, this bound scales exponentially with p, the dimension of the state space. In this work we consider the case where the matrices describing the dynamic of the LQ system are sparse and their dimensions are large. We present an adaptive control scheme that achieves a regret bound of 图片.png , apart from logarithmic factors. In particular, our algorithm has an average cost of times the optimum cost after T = polylogThis is in comparison to previous work on the dense dynamics where the algorithm requires time that scales exponentially with dimension in order to achieve regret of times the optimal cost. We believe that our result has prominent applications in the emerging area of computational advertising, in particular targeted online advertising and advertising in social networks.

上一篇：Bayesian Hierarchical Reinforcement Learning

下一篇：Neurally Plausible Reinforcement Learning of Working Memory Tasks

用户评价

全部评价

还没有评论，说两句吧！

热门资源

A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Learning to Predi...

Much of model-based reinforcement learning invo...
Joint Pose and Ex...

Facial expression recognition (FER) is a challe...
Bounding the Inef...

Social networks on the Internet have seen an en...
The Variational S...

Unlike traditional images which do not offer in...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com