EE5331
Speech Signal Processing
(1) The speech signal – process of speech production – acoustic phonetics – the speech chain – anatomy of the ear – sound perception – audttory models (2) Acoustic thery of speech production – lossless tube models – digital models for sampled speech signals (3) Time domain processing of speech signals: short-time energy, magnitude, zero-crossing rate, autocorrelation function, AMDF. (4) Frequency domain representations for speech signals: short-time Fourier analysis and its modifications (5) Cepstrum and homomorphic speech processing: short-time cepsturm and complex cepstrum – cepstrum analysis of all-pole models-cepstrum distance measures. (6) Linear predictiv analysis of speech: basics of LP analysis-computation of model gain-frequency domain interpretation-solution of the LP equations-prediction error signal-properties of the LP polynomial-alternative representations of the LP coefficients. (7) Applications: introduction to speech coding, recognition, and synthesis.
Lawrence Rabiner and Ronals Schafer, “Theory and Applications of Digital Speech Processing”, Prentice Hall, 2011
1. T.F. Quatieri, Discrete-Time Speech Signal Processing, Prentice Hall 2002.
2. L.T. Rabiner and R. Schafer, Digital Processing of Speech Signals, Prentice Hall, 1978.
3. Douglas O”Shaughnessy, Speech Communciations: Human and Machine, Universities Press, 2001.
4. J. Benetsy, M.M. Sondhi, Y. Huang (eds) Springer Handbook of Speech Processing, Springer-Verlag, 2008.
