BADGER: Learning to (Learn [Learning Algorithms] through Multi-Agent Communication)

Marek Rosa,Olga Afanasjeva,Simon Andersson,Joseph Davidson,Nicholas Guttenberg,Petr Hlubucek,Martin Poliak,Jaroslav Vítků,Jan Feyereisl

BADGER: Learning to (Learn [Learning Algorithms] through Multi-Agent Communication)

2019

Marek Rosa
Olga Afanasjeva
Simon Andersson
Joseph Davidson
Nicholas Guttenberg
Petr Hlubucek
Martin Poliak
Jaroslav Vítků
Jan Feyereisl

In this work, we propose a novel memory-based multi-agent meta-learning architecture and learning procedure that allows for learning of a shared communication policy that enables the emergence of rapid adaptation to new and unseen environments by learning to learn learning algorithms through communication. Behavior, adaptation and learning to adapt emerges from the interactions of homogeneous experts inside a single agent. The proposed architecture should allow for generalization beyond the level seen in existing methods, in part due to the use of a single policy shared by all experts within the agent as well as the inherent modularity of 'Badger'.

Keywords:

Badger
Architecture
Computer science
Modularity
learning to learn
Algorithm
Homogeneous
single agent

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations