Robust Speaker Verification in Air Traffic Control using Improved Voice Activity Detection

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

In this paper, a robust speaker verification system using improved voice activity detection has been designed for increasing safety of air traffic control. In addition to the usage of the aircraft identification tag to assign speaker turns on the shared communication channel to aircrafts,
speaker verification is investigated as an add-on attribute to improve security level effectively for the air traffic control. The front-end processing unit is optimized to deal with small bandwidth restrictions and very short speaker turns. Two adaptive voice activity detection methods based on energy and wavelet parameters are developed and used as pre-processing in front-end unit. The verification task is accomplished by training background models and speaker dependent models. To enhance the robustness of the verification system, a cross verification unit is further applied.
The designed system is tested with SPEECHDAT-AT and WSJ0 database to demonstrate its superior performance.
Original languageEnglish
Title of host publicationProceedings of the Fourth IASTED International Conference Signal Processing, Pattern Recognition, and Applications
Place of PublicationAnaheim, Ca.
PublisherActa Press
Pages298-303
ISBN (Print)978-0-88986-646-1
Publication statusPublished - 2007
Event4th IASTED International Conference Signal Processing, Pattern Recognition, and Applications: SPPR 2007 - Innsbruck, Austria
Duration: 14 Feb 200716 Feb 2007

Conference

Conference4th IASTED International Conference Signal Processing, Pattern Recognition, and Applications
Abbreviated titleSPPR'07
Country/TerritoryAustria
CityInnsbruck
Period14/02/0716/02/07

Treatment code (Nähere Zuordnung)

  • Basic - Fundamental (Grundlagenforschung)
  • Application
  • Popular Scientific

Fingerprint

Dive into the research topics of 'Robust Speaker Verification in Air Traffic Control using Improved Voice Activity Detection'. Together they form a unique fingerprint.

Cite this