Active correction for speaker diarization with human in the loop

Yevhenii Prokopalo,Meysam Shamsi,Loïc Barrault,Sylvain Meignier,Anthony Larcher

Active correction for speaker diarization with human in the loop

2021

Yevhenii Prokopalo
Meysam Shamsi
Loïc Barrault
Sylvain Meignier
Anthony Larcher

State of the art diarization systems now achieve decent performance but those performances are often not good enough to deploy them without any human supervision. In this paper we propose a framework that solicits a human in the loop to correct the clustering by answering simple questions. After defining the nature of the questions, we propose an algorithm to list those questions and two stopping criteria that are necessary to limit the work load on the human in the loop. Experiments performed on the ALLIES dataset show that a limited interaction with a human expert can lead to considerable improvement of up to 36.5% relative diarization error rate (DER) compared to a strong baseline.

Keywords:

Human-in-the-loop
Computer science
Speaker diarisation
Speech recognition

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations