Relevance Judgments for Image Retrieval Evaluation

Jayashree Kalpathy–Cramer,Steven Bedrick,William R. Hersh

Relevance Judgments for Image Retrieval Evaluation

2010

In this chapter, we review our experiences with the relevance judging process at ImageCLEF, using the medical retrieval task as a primary example. We begin with a historic perspective of the precursor to most modern retrieval evaluation campaigns, the Cranfield paradigm, as most modern system–based evaluation campaigns including ImageCLEF are modeled after it. We then briefly describe the stages in an evaluation campaign and provide details of the different aspects of the relevance judgment process. We summarize the recruitment process and describe the various systems used for judgment at ImageCLEF. We discuss the advantages and limitations of creating pools that are then judged by human experts. Finally, we discuss our experiences with the subjectivity of the relevance process and the relative robustness of the performance measures to variability in relevance judging.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations