Speech communication is an experimental science where practical experience with the natural speech production and perception mechanism as well as with technical speech processing systems, their perceptual quality and their usability, are key components in the education of spoken language engineers. This laboratory introduces measurement methods and tools for speech transmission and speech perception (e.g., intelligibility vs. naturalness, mean-opinion-score testing), for the assessment of speech coders and speech recognizers in clean and disturbed environments. Finally, it addresses the design and configuration of large-scale dialogue systems through a prototypical application development task.

The course consists of 6 units of 4 hours each:
  • Speech signal analysis in the time, frequency, and time-frequency domains
  • Speech synthesis by time-domain concatenation and prosody modification
  • Backpropagation and Extensions
  • Radial Basis Function Networks
  • Speech coding
  • Hidden Markov Models
  • Speech recognition using the Hidden Markov Model Toolkit (HTK)