Acoustic COVID-19 Detection Using Multiple Instance Learning

Research output: Contribution to journalArticlepeer-review

Abstract

In the COVID-19 pandemic, a rigorous testing scheme was crucial. However, tests can be time-consuming and expensive. A machine learning-based diagnostic tool for audio recordings could enable widespread testing at low costs. In order to achieve comparability between such algorithms, the DiCOVA challenge was created. It is based on the Coswara dataset offering the recording categories cough, speech, breath and vowel phonation. Recording durations vary greatly, ranging from one second to over a minute. A base model is pre-trained on random, short time intervals. Subsequently, a Multiple Instance Learning (MIL) model based on self-attention is incorporated to make collective predictions for multiple time segments within each audio recording, taking advantage of longer durations. In order to compete in the fusion category of the DiCOVA challenge, we utilize a linear regression approach among other fusion methods to combine predictions from the most successful models associated with each sound modality. The application of the MIL approach significantly improves generalizability, leading to an AUC ROC score of 86.6% in the fusion category. By incorporating previously unused data, including the sound modality 'sustained vowel phonation' and patient metadata, we were able to significantly improve our previous results reaching a score of 92.2%.

Original languageEnglish
Pages (from-to)620-630
Number of pages11
JournalIEEE Journal of Biomedical and Health Informatics
Volume29
Issue number1
Early online date4 Oct 2024
DOIs
Publication statusPublished - 2025

Keywords

  • Audio-based Infection Prediction
  • Coswara
  • COVID-19
  • Crowdsourced Datasets
  • DiCOVA
  • Multiple Instance Learning
  • crowdsourced datasets
  • multiple instance learning
  • coswara
  • audio-based infection prediction

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics
  • Electrical and Electronic Engineering
  • Health Information Management

Fingerprint

Dive into the research topics of 'Acoustic COVID-19 Detection Using Multiple Instance Learning'. Together they form a unique fingerprint.

Cite this