Tecnologías de la Voz

El objetivo principal de este curso es proporcionar los conceptos y herramientas básicas utilizadas en el procesamiento de la señal de voz. Se presentan los conceptos básicos necesarios de acústica y se estudia con detalle los procesos de generacón y percepción del habla y las técnicas básicas de procesado digital de señal utilizadas en el análisis de la señal de voz.
Desde el curso 2006/07 este curso se oferta en inglés

Course Agenda 2008-2009

Part I: Fundamental Theory
    0. Fundamentals of Acoustics
    1. Sound and Noise
    2. Sound and Noise Level Measure
    3. Sound Propagation
    4. Harmonic Plane Waves
    5. Pipes and Cavities: Acoustic Circuits
    1. Speech production
    1. Anatomy and Physiology of the Speech Organs
    2. Acoustic Phonetics
    3. Acoustic Theory of Speech Production
    2. Hearing and Speech Perception
    1. Anatomy and Physiology of the Ear
    2. Sound Perception
    3. Speech Analysis and Representation
    1. Short-Time Speech Analysis
    2. Time-Domain Parameters
    3. Short-Time Fourier Analysis
    4. Linear Predictive Coding Analysis
    5. Cepstral Analysis
    6. Speech enhancement: Additive and Convolutive noise
    7. Perceptually motivated representations
    4. Speech and Audio Coding
    1. Speech Coders Attributes
    2. Frequency Domain Coders
    3. Analysis by Synthesis Coders

Part II: Speech Technologies

    5. Pattern Recognition
    1. Bayes´ Decision Theory
    2. Classifiers
    3. Feature Extraction and Selection
    6. Speech Recognition
    1. Why is it so difficult?
    2. A Basic Pattern Recognition Approach
    3. Statistical Speech Recognition
    4. Acoustic Modelling: Hidden Markov Models
    5. Language Modelling
    6. Basic Search Algorithms
    7. Voice Portal Development: VoiceXML

Part III: Laboratory Work