We present a reinforcement learning architecture, Dyna-2, that encompasses both sample-based learning and sample-based search, and that generalises across states during both learning and search. We ap
MORE VIDEOS FROM THE EVENT
MORE VIDEOS FROM THE SAME CATEGORIES
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International license.