About
Description is not available.
Uploaded videos
23:29
A Worst-Case Comparison Between Temporal Difference and Residual Gradient with L...
Sep 4, 2019 176 views
21:04
The Online Discovery Problem and Its Application to Lifelong Reinforcement Learn...
Jul 28, 2015 2571 views
21:49
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendati...
Aug 9, 2011 3820 views
28:19
Knows What It Knows: A Framework For Self-Aware Learning
Jul 24, 2008 7699 views
