Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot.

Joel Z. Leibo,Edgar A. Duéñez-Guzmán,Alexander Vezhnevets,John P. Agapiou,Peter Sunehag,Raphaël Koster,Jayd Matyas,Charles Beattie,Igor Mordatch,Thore Graepel

Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot.

2021

Joel Z. Leibo
Edgar A. Duéñez-Guzmán
Alexander Vezhnevets
John P. Agapiou
Peter Sunehag
Raphaël Koster
Jayd Matyas
Charles Beattie
Igor Mordatch
Thore Graepel

Existing evaluation suites for multi-agent reinforcement learning (MARL) do not assess generalization to novel situations as their primary objective (unlike supervised-learning benchmarks). Our contribution, Melting Pot, is a MARL evaluation suite that fills this gap, and uses reinforcement learning to reduce the human labor required to create novel test scenarios. This works because one agent's behavior constitutes (part of) another agent's environment. To demonstrate scalability, we have created over 80 unique test scenarios covering a broad range of research topics such as social dilemmas, reciprocity, resource sharing, and task partitioning. We apply these test scenarios to standard MARL training algorithms, and demonstrate how Melting Pot reveals weaknesses not apparent from training performance alone.

Keywords:

Scenario testing
Computer science
Scalability
Reinforcement learning
reciprocity
Suite
Generalization
Shared resource
task
Artificial intelligence
Machine learning

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations