SemEval-2022 Task 9: R2VQ Competence-based Multimodal Question Answering

Jingxuan Tu,Eben Holderness,Marco Maru,Simone Conia,Kyeongmin Rim,Kelley Lynch,Richard Brutti,Roberto Navigli,James Pustejovsky

SemEval-2022 Task 9: R2VQ Competence-based Multimodal Question Answering

2022

Jingxuan Tu
Eben Holderness
Marco Maru
Simone Conia
Kyeongmin Rim
Kelley Lynch
Richard Brutti
Roberto Navigli
James Pustejovsky

In this task, we identify a challenge that is reflective of linguistic and cognitive competencies that humans have when speaking and reasoning. Particularly, given the intuition that textual and visual information mutually inform each other for semantic reasoning, we formulate a Competence-based Question Answering challenge, designed to involve rich semantic annotation and aligned text-video objects. The task is to answer questions from a collection of cooking recipes and videos, where each question belongs to a question family reflecting a specific reasoning competence. The data and task result is publicly available.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations