Menu
Workshop on Principled Methods of Trading Exploration and Exploitation London, 2005

Workshop on Principled Methods of Trading Exploration and Exploitation London, 2005

9 Videos · Jul 5, 2005

About

Traditional off-line learning methods are often not appropriate for applications in user modelling and user interfaces since to be useful the system must learn about the user or context during the process of interaction 'on the fly'. This immediately raises the fundamental problem of trading off exploration and exploitation in that as information is learnt the system may be tempted to act in line with this insight rather than further exploring alternatives. Machine learning has developed a number of models that attempt to capture and analyse this trade-off, from the simplest bandit problem to the full Markov decision processes underlying reinforcement learning.

The workshop includes tutorials covering the bandit analysis as well as its relevance to user modelling. Reinforcement learning would also be included with particular emphasis on applications in user interfaces. It is also hoped to launch a challenge in this area. This workshop comes under the Thematic Programme 4: Online User Modelling and Reinforcement Learning and is a core meeting of the PASCAL Network.

Videos

Lectures

video-img
58:38

Research Problems in applaying RL in Interactive Systems Towards a Taxonomy...

Chris Watkins

calendar icon Feb 25, 2007 3054 views

video-img
48:29

Overview of Results Pump Priming Project

Samy Bengio

calendar icon Feb 25, 2007 2968 views

video-img
58:03

Clustering from an Optimization viewpoint Exploration and Exploitation using Upp...

Moses Charikar

calendar icon Feb 25, 2007 4304 views

video-img
01:14:41

Models for Trading Exploration and Exploitation using Upper Confidence Bounds

Peter Auer

calendar icon Feb 25, 2007 3687 views

video-img
45:37

Presentation of proposed outline challenge

Jason McFall

calendar icon Feb 25, 2007 3746 views

video-img
57:28

The exploration and exploitation tradeoff: Strategy learning and queries

Colin de la Higuera

calendar icon Feb 25, 2007 3714 views

video-img
12:06

Gradient-Based Estimates of Return Distributions

Christos Dimitrakakis

calendar icon Feb 25, 2007 2601 views

video-img
01:03:37

mSpace

Monica Schraefel

calendar icon Feb 25, 2007 3129 views

video-img
01:28:36

Multiarmed Bandits and Partial Monitoring Exploration and Exploitation using Upp...

Nicolò Cesa-Bianchi

calendar icon Feb 25, 2007 3976 views

Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.