Domain Adversarial Reinforcement Learning.

Bonnie Li,Vincent François-Lavet,Thang Doan,Joelle Pineau

Domain Adversarial Reinforcement Learning.

2021

Bonnie Li
Vincent François-Lavet
Thang Doan
Joelle Pineau

We consider the problem of generalization in reinforcement learning where visual aspects of the observations might differ, e.g. when there are different backgrounds or change in contrast, brightness, etc. We assume that our agent has access to only a few of the MDPs from the MDP distribution during training. The performance of the agent is then reported on new unknown test domains drawn from the distribution (e.g. unseen backgrounds). For this "zero-shot RL" task, we enforce invariance of the learned representations to visual domains via a domain adversarial optimization process. We empirically show that this approach allows achieving a significant generalization improvement to new unseen domains.

Keywords:

a domain
Adversarial system
Reinforcement learning
Brightness
Computer science
Invariant (physics)
Artificial intelligence

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations