Abstract
While the audio recordings of a corpus represent the ground truth, transcriptions are – in the case of manual annotations – subject to human error, and subject to changes related to technology improvements underpinning automated annotation methods. In order to facilitate the dynamic extension of speech corpora, we introduce Speechcake, a tool for centralized version control for speech corpora, enabling the automatic check-in and merging of annotations. It considers typical workflows of phoneticians, linguists and speech technologists, and enables the development of dynamic, collaborative, and perpetually-improving speech corpora.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024) |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 303-308 |
| Publication status | Published - 2024 |
| Event | 20th Conference on Natural Language Processing, KONVENS 2024 - Vienna, Austria Duration: 10 Sept 2024 → 13 Sept 2024 |
Conference
| Conference | 20th Conference on Natural Language Processing, KONVENS 2024 |
|---|---|
| Country/Territory | Austria |
| City | Vienna |
| Period | 10/09/24 → 13/09/24 |
Fingerprint
Dive into the research topics of 'Version Control for Speech Corpora'. Together they form a unique fingerprint.Projects
- 2 Finished
-
FWF - Spontansprache - Cross-layer language models for conversational speech
Schuppler, B. (Consortium manager resp. coordinator with external organisations) & Schuppler, B. (Project manager on research unit)
1/11/19 → 31/10/24
Project: Research project
-
FWF - CLCS_2 - Cross-layer prosodic models for conversational speech
Schuppler, B. (Project manager on research unit)
1/10/18 → 30/11/21
Project: Research project
Prizes
Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS