Online Learning in Non-Stationary Markov Decision Processes

We consider online learning in Markov decision processes with adversarial reward functions. Depending on the information available to the decision maker, we analyze two scenarios: in one setup the

RELATED CATEGORIES

ON-LINE LEARNING

Online Learning in Non-Stationary Markov Decision Processes

Gergely Neu

RELATED CATEGORIES

MORE VIDEOS FROM THE EVENT

MORE VIDEOS FROM THE SAME CATEGORIES