Menu
video thumbnail
Pause
Mute
Subtitles
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Online Learning in Non-Stationary Markov Decision Processes

Published on 2013-08-063062 Views

We consider online learning in Markov decision processes with adversarial reward functions. Depending on the information available to the decision maker, we analyze two scenarios: in one setup the

Related categories

Presentation

Online learning in non-stationry Markov decision processes00:00
The learning problem01:27
Results03:54
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.