Advanced

Doppler-variant modeling of the vocal tract

Heim, Axel; Sorger, Uli and Hug, Florian LU (2008) International Conference on Acoustics, Speech and Signal Processing (ICASSP) In [Host publication title missing] p.4197-4200
Abstract
A common technique to deploy linear prediction to non- stationary signals is time segmentation and local analysis. Variations of a process within such a segment cause inac- curacies. In this paper, we model the temporal changes of linear prediction coefficients (LPCs) as a Fourier series. We obtain a compact description of the vocal tract model limited by the predictor order and the maximum Doppler frequency. Filter stability is guaranteed by all-pass filtering, deploying the human ear’s insensitivity to absolute phase. The period- icity constraint induced by the Fourier series is counteracted by oversampling in the Doppler domain. With this approach, the number of coefficients required for the vocal tract model- ing is significantly... (More)
A common technique to deploy linear prediction to non- stationary signals is time segmentation and local analysis. Variations of a process within such a segment cause inac- curacies. In this paper, we model the temporal changes of linear prediction coefficients (LPCs) as a Fourier series. We obtain a compact description of the vocal tract model limited by the predictor order and the maximum Doppler frequency. Filter stability is guaranteed by all-pass filtering, deploying the human ear’s insensitivity to absolute phase. The period- icity constraint induced by the Fourier series is counteracted by oversampling in the Doppler domain. With this approach, the number of coefficients required for the vocal tract model- ing is significantly reduced compared to a LPC system with block-wise adaptation while exceeding its prediction gain.

As a by-product it is found that the Doppler frequency of the vocal tract is in the order of 10 Hz. A generalization of the algorithm to an auto-regressive moving average model with time-correlated filter coefficients is straight forward. (Less)
Please use this url to cite or link to this publication:
author
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
in
[Host publication title missing]
pages
4197 - 4200
conference name
International Conference on Acoustics, Speech and Signal Processing (ICASSP)
external identifiers
  • Scopus:51449083635
ISSN
1520-6149
ISBN
978-1-4244-1483-3
DOI
10.1109/ICASSP.2008.4518580
language
English
LU publication?
no
id
fe177718-ac52-418c-b5b2-db50b84226a3 (old id 1527911)
date added to LUP
2010-01-13 14:56:15
date last changed
2016-10-13 04:30:39
@misc{fe177718-ac52-418c-b5b2-db50b84226a3,
  abstract     = {A common technique to deploy linear prediction to non- stationary signals is time segmentation and local analysis. Variations of a process within such a segment cause inac- curacies. In this paper, we model the temporal changes of linear prediction coefficients (LPCs) as a Fourier series. We obtain a compact description of the vocal tract model limited by the predictor order and the maximum Doppler frequency. Filter stability is guaranteed by all-pass filtering, deploying the human ear’s insensitivity to absolute phase. The period- icity constraint induced by the Fourier series is counteracted by oversampling in the Doppler domain. With this approach, the number of coefficients required for the vocal tract model- ing is significantly reduced compared to a LPC system with block-wise adaptation while exceeding its prediction gain.<br/><br>
As a by-product it is found that the Doppler frequency of the vocal tract is in the order of 10 Hz. A generalization of the algorithm to an auto-regressive moving average model with time-correlated filter coefficients is straight forward.},
  author       = {Heim, Axel and Sorger, Uli and Hug, Florian},
  isbn         = {978-1-4244-1483-3},
  issn         = {1520-6149},
  language     = {eng},
  pages        = {4197--4200},
  series       = {[Host publication title missing]},
  title        = {Doppler-variant modeling of the vocal tract},
  url          = {http://dx.doi.org/10.1109/ICASSP.2008.4518580},
  year         = {2008},
}