Multi-attending Memory Network for Modeling Multi-turn Dialogue

Jianlong Ren,Li Yang,Chun Zuo,Weiyi Kong,Xiaoxiao Ma

Multi-attending Memory Network for Modeling Multi-turn Dialogue

2019

Modeling and reasoning about the dialogue history is a main challenge for building a good multi-turn conversational agent. End-to-end memory networks with recurrent or gated architectures have been demonstrated promising for conversation modeling. However, it still suffers from relatively low computational efficiency for its complex architectures and costly strong supervision information or fixed priori knowledge. This paper proposes a multi-head attention based end-to-end approach called multi-attending memory network without additional information or knowledge, which can effectively model and reason about multi-turn history dialogue. Specifically, a parallel multi-head attention mechanism is introduced to model conversational context via attending to different important sections of a full dialog. Thereafter, a stacked architecture with shortcut connections is presented to reason about the memory (the result of context modeling). Experiments on the bAbI-dialog datasets demonstrate the effectiveness of proposed approach.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations