Lihong Li

Description is not available.

A Worst-Case Comparison Between Temporal Difference and Residual Gradient with L...

Lihong Li

Sep 4, 2019 176 views

The Online Discovery Problem and Its Application to Lifelong Reinforcement Learn...

Lihong Li

Jul 28, 2015 2575 views

Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendati...

Lihong Li

Aug 9, 2011 3823 views

Knows What It Knows: A Framework For Self-Aware Learning

Lihong Li

Jul 24, 2008 7699 views

Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.