Authors
Vivek Tyagi, Christian Wellekens
Publication date
2005/11/27
Conference
IEEE Workshop on Automatic Speech Recognition and Understanding, 2005.
Pages
11-16
Publisher
IEEE
Description
Pole-zero spectral models in the frequency domain have been well studied and understood in the past several decades. Exploiting the duality between the temporal domain and the frequency domain, Kumaresan et al (R. Kumaresan, et al., March 1999), (R. Kumaresan, October 1998) have shown that the pole-zero model of the analytic speech signal in the temporal domain leads to its characterization in terms of the positive amplitude modulation (AM) and positive instantaneous frequency (PIF). In this paper, we carefully define AM and frequency modulation (FM) signals in the context of ASR. We show that for a theoretically meaningful estimation of the AM signal, it is necessary to decompose the speech signal into several narrow spectral bands as opposed to the previous use of the speech modulation spectrum (V. Tyagi, et al., 2003), (M. Athineos and D. Ellis, 2003), (M. Athineos, et al., April 2004), (Q. Zhu, and A …
Total citations
200620072008200920102011201220132014201520162017201820192020121131211
Scholar articles
V Tyagi, C Wellekens - IEEE Workshop on Automatic Speech Recognition and …, 2005