Reinforcement Learning in Decentralized Stochastic Control Systems with Partial History Sharing
In this paper, we are interested in systems with multiple agents that wish to cooperate in order to accomplish a common task while a) agents have different information (decentralized information) and b) agents do not know the complete model of the system i.e., they may only know the partial model or may not know the model at all. The agents must learn the optimal strategies by interacting with their environment i.e., by multi-agent Reinforcement Learning (RL). The presence of multiple agents with different information makes multi-agent (decentralized) reinforcement learning conceptually more difficult than single-agent (centralized) reinforcement learning. We propose a novel multi-agent reinforcement learning algorithm that learns epsilon-team-optimal solution for systems with partial history sharing information structure, which encompasses a large class of multi-agent systems including delayed sharing, control sharing, mean field sharing, etc. Our approach consists of two main steps as follows: 1) the multiagent (decentralized) system is converted to an equivalent single-agent (centralized) POMDP (Partial Observable Markov Decision Process) using the common information approach of Nayyar et al, TAC 2013, and 2) based on the obtained POMDP, an approximate RL algorithm is constructed using a novel methodology. We show that the performance of the RL strategy converges to the optimal performance exponentially fast. We illustrate the proposed approach and verify it numerically by obtaining a multi-agent Q-learning algorithm for two-user Multi Access Broadcast Channel (MABC) which is a benchmark example for multi-agent systems.
RELATED CATEGORIES
MORE VIDEOS FROM THE EVENT
Utility-weighted sampling in decisions from experience
Falk Lieder
Jul 28, 2015 2165 views
Natural RLDM: Optimal and Subptimal Control in Brain and Behavior
Nathaniel Daw
Jul 28, 2015 4867 views
Quickly Learning to Make Good Decisions
Emma Brunskill
Jul 28, 2015 4209 views
Basics of Computational Reinforcement Learning
Michael Littman
Jul 28, 2015 23554 views
Deep Reinforcement Learning
David Silver
Jul 28, 2015 124514 views
MORE VIDEOS FROM THE SAME CATEGORIES
3. Bio-inspired Flying Robots
Sabine Hauert
Jun 2, 2008 7608 views
Learning to Behave by Reading
Regina Barzilay
Aug 17, 2012 5981 views
Usage of the Kalman filter for data cleaning of sensor data
Klemen Kenda
Oct 30, 2013 3618 views
Tropical Cyclone Event Sequence Similarity Search via Dimensionality Reduction a...
Shen-Shyang Ho
Oct 1, 2010 3357 views
16. CogSketch: Open-Domain Sketch Understanding for Research and Education
Andrew Lovett,
Penny Yin,
Jon Wetzel,
Jeffrey Usher,
Kate Lockwood,
Kenneth Forbus
Jun 20, 2009 5507 views