Robust Optimization for Hybrid MDPs with State-Dependent Noise Zahra Zamani Karina Valdivia Delgado

资源分类

2019-11-11 |

46 |

33 |

Abstract Recent advances in solutions to Hybrid MDPs with discrete and continuous state and action spaces have signi?cantly extended the class of MDPs for which exact solutions can be derived, albeit at the expense of a restricted transition noise model. In this paper, we work around limitations of previous solutions by adopting a robust optimization approach in which Nature is allowed to adversarially determine transition noise within pre-speci?ed con?dence intervals. This allows one to derive an optimal policy with an arbitrary (user-speci?ed) level of success probability and signi?cantly extends the class of transition noise models for which Hybrid MDPs can be solved. This work also signi?cantly extends results for the related “chance-constrained” approach in stochastic hybrid control to accommodate state-dependent noise. We demonstrate our approach working on a variety of hybrid MDPs taken from AI planning, operations research, and control theory, noting that this is the ?rst time robust solutions with strong guarantees over all states have been automatically derived for such problems.

上一篇：Interactive Value Iteration for Markov Decision Processes with Unknown Rewards

下一篇：Action-Model Acquisition from Noisy Plan Traces Hankz Hankui Zhuoa and Subbarao Kambhampatib

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Rating-Boosted La...

The performance of a recommendation system reli...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com