Computer Audition for Fighting the SARS-CoV-2 Corona Crisis – Introducing the Multi-task Speech Corpus for COVID-19

2021 
Computer audition (CA) has experienced a fast development in the past decades by leveraging advanced signal processing and machine learning techniques. In particular, for its non-invasive and ubiquitous character by nature, CA based applications in healthcare have increasingly attracted attention in recent years. During the tough time of the global crisis caused by the COVID-19 (coronavirus disease 2019), scientists and engineers in data science have collaborated to think of novel ways in prevention, diagnosis, treatment, tracking, and management of this global pandemic. On one hand, we have witnessed the power of 5G, internet of things, big data, computer vision, and artificial intelligence in applications of epidemiology modelling, drug and/or vaccine finding and designing, fast CT screening, and quarantine management. On the other hand, relevant studies in exploring the capacity of CA are extremely lacking and underestimated. To this end, we propose a novel multi-task speech corpus for COVID-19 research usage. We collected 51 confirmed COVID-19 patients’ in-the-wild speech data in Wuhan city, China. We define three main tasks in this corpus, i. e., three-category classification tasks for evaluating the physical and/or mental status of patients, i. e., sleep quality, fatigue, and anxiety. The benchmarks are given by using both classic machine learning methods and state-of-the-art deep learning techniques. We believe this study and corpus cannot only facilitate the ongoing research on using data science to fight against COVID-19, but also the monitoring of contagious diseases for general purpose.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    91
    References
    3
    Citations
    NaN
    KQI
    []