Experience Replay Q(λ)-learning with Leader-Following Control for Multi-Evader Pursuit Evasion Games

Zhe-Yang Zhu,Cheng-Lin Liu

Experience Replay Q(λ)-learning with Leader-Following Control for Multi-Evader Pursuit Evasion Games

2021

Zhe-Yang Zhu
Cheng-Lin Liu

This paper addresses a pursuit evasion game with multi-evader, and only some pursuers which are termed as leading pursuers can access the positions of the evaders during the pursuit. Furthermore, Q(λ)-learning is utilized to train the pursuers. Because Q(λ)-learning exhibits slow convergence, a new method that combines experience replay Q(λ)-learning and dynamic target assignment is introduced. Simulation shows that the proposed method achieves better convergence results than Q(λ)-learning in our multi-evader pursuit evasion game.

Keywords:

Mathematical optimization
slow convergence
Pursuit-evasion
leader following
control
Convergence (routing)
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations