Handling Anomalies of Synthetic Questions in Unsupervised Question Answering

Giwon Hong,Junmo Kang,Doyeon Lim,Sung-Hyon Myaeng

Handling Anomalies of Synthetic Questions in Unsupervised Question Answering

2020

Giwon Hong
Junmo Kang
Doyeon Lim
Sung-Hyon Myaeng

Advances in Question Answering (QA) research require additional datasets for new domains, languages, and types of questions, as well as for performance increases. Human creation of a QA dataset like SQuAD, however, is expensive. As an alternative, an unsupervised QA approach has been proposed so that QA training data can be generated automatically. However, the performance of unsupervised QA is much lower than that of supervised QA models. We identify two anomalies in the automatically generated questions and propose how they can be mitigated. We show our approach helps improve unsupervised QA significantly across a number of QA tasks.

Keywords:

Question answering
Artificial intelligence
Natural language processing
Computer science
Training set

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations