Abstract
Sum-product networks (SPNs) are a recently proposed type of probabilistic graphical models allowing complex variable interactions while still granting efficient inference. In this paper we demonstrate the suitability of SPNs for modeling log-spectra of speech signals using the application of artificial bandwidth extension, i.e. artificially replacing the high-frequency content which is lost in telephone signals. We use SPNs as observation models in hidden Markov models (HMMs), which model the temporal evolution of log short-time spectra. Missing frequency bins are replaced by the SPNs using most-probable-explanation inference, where the state-dependent reconstructions are weighted with the HMM state posterior. According to subjective listening and objective evaluation, our system consistently and significantly improves the state of the art.
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing |
| Pages | 3699-3703 |
| ISBN (Electronic) | 978-1-4799-2893-4 |
| DOIs | |
| Publication status | Published - 2014 |
| Event | 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing: ICASSP 2014 - Florence, Italy Duration: 4 May 2014 → 9 May 2014 |
Conference
| Conference | 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing |
|---|---|
| Country/Territory | Italy |
| City | Florence |
| Period | 4/05/14 → 9/05/14 |
Fields of Expertise
- Information, Communication & Computing
Fingerprint
Dive into the research topics of 'Modeling Speech with Sum- Product Networks: Application to Bandwidth Extension'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS