Modeling Speech with Sum- Product Networks: Application to Bandwidth Extension

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Sum-product networks (SPNs) are a recently proposed type of probabilistic graphical models allowing complex variable interactions while still granting efficient inference. In this paper we demonstrate the suitability of SPNs for modeling log-spectra of speech signals using the application of artificial bandwidth extension, i.e. artificially replacing the high-frequency content which is lost in telephone signals. We use SPNs as observation models in hidden Markov models (HMMs), which model the temporal evolution of log short-time spectra. Missing frequency bins are replaced by the SPNs using most-probable-explanation inference, where the state-dependent reconstructions are weighted with the HMM state posterior. According to subjective listening and objective evaluation, our system consistently and significantly improves the state of the art.
Original languageEnglish
Title of host publication2024 IEEE International Conference on Acoustics, Speech, and Signal Processing
Pages3699-3703
ISBN (Electronic)978-1-4799-2893-4
DOIs
Publication statusPublished - 2014
Event2014 IEEE International Conference on Acoustics, Speech, and Signal Processing: ICASSP 2014 - Florence, Italy
Duration: 4 May 20149 May 2014

Conference

Conference2014 IEEE International Conference on Acoustics, Speech, and Signal Processing
Country/TerritoryItaly
CityFlorence
Period4/05/149/05/14

Fields of Expertise

  • Information, Communication & Computing

Fingerprint

Dive into the research topics of 'Modeling Speech with Sum- Product Networks: Application to Bandwidth Extension'. Together they form a unique fingerprint.

Cite this