Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Doppler-variant modeling of the vocal tract

Heim, Axel ; Sorger, Uli and Hug, Florian LU (2008) International Conference on Acoustics, Speech and Signal Processing (ICASSP) p.4197-4200
Abstract
A common technique to deploy linear prediction to non- stationary signals is time segmentation and local analysis. Variations of a process within such a segment cause inac- curacies. In this paper, we model the temporal changes of linear prediction coefficients (LPCs) as a Fourier series. We obtain a compact description of the vocal tract model limited by the predictor order and the maximum Doppler frequency. Filter stability is guaranteed by all-pass filtering, deploying the human ear’s insensitivity to absolute phase. The period- icity constraint induced by the Fourier series is counteracted by oversampling in the Doppler domain. With this approach, the number of coefficients required for the vocal tract model- ing is significantly... (More)
A common technique to deploy linear prediction to non- stationary signals is time segmentation and local analysis. Variations of a process within such a segment cause inac- curacies. In this paper, we model the temporal changes of linear prediction coefficients (LPCs) as a Fourier series. We obtain a compact description of the vocal tract model limited by the predictor order and the maximum Doppler frequency. Filter stability is guaranteed by all-pass filtering, deploying the human ear’s insensitivity to absolute phase. The period- icity constraint induced by the Fourier series is counteracted by oversampling in the Doppler domain. With this approach, the number of coefficients required for the vocal tract model- ing is significantly reduced compared to a LPC system with block-wise adaptation while exceeding its prediction gain.

As a by-product it is found that the Doppler frequency of the vocal tract is in the order of 10 Hz. A generalization of the algorithm to an auto-regressive moving average model with time-correlated filter coefficients is straight forward. (Less)
Please use this url to cite or link to this publication:
author
; and
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
host publication
ITG Conference on Voice Communication [8. ITG-Fachtagung]
pages
4197 - 4200
conference name
International Conference on Acoustics, Speech and Signal Processing (ICASSP)
conference dates
2008-03-31 - 2008-04-04
external identifiers
  • scopus:51449083635
ISSN
1520-6149
ISBN
978-1-4244-1483-3
DOI
10.1109/ICASSP.2008.4518580
language
English
LU publication?
no
id
fe177718-ac52-418c-b5b2-db50b84226a3 (old id 1527911)
date added to LUP
2016-04-04 09:26:40
date last changed
2022-01-29 17:54:40
@inproceedings{fe177718-ac52-418c-b5b2-db50b84226a3,
  abstract     = {{A common technique to deploy linear prediction to non- stationary signals is time segmentation and local analysis. Variations of a process within such a segment cause inac- curacies. In this paper, we model the temporal changes of linear prediction coefficients (LPCs) as a Fourier series. We obtain a compact description of the vocal tract model limited by the predictor order and the maximum Doppler frequency. Filter stability is guaranteed by all-pass filtering, deploying the human ear’s insensitivity to absolute phase. The period- icity constraint induced by the Fourier series is counteracted by oversampling in the Doppler domain. With this approach, the number of coefficients required for the vocal tract model- ing is significantly reduced compared to a LPC system with block-wise adaptation while exceeding its prediction gain.<br/><br>
As a by-product it is found that the Doppler frequency of the vocal tract is in the order of 10 Hz. A generalization of the algorithm to an auto-regressive moving average model with time-correlated filter coefficients is straight forward.}},
  author       = {{Heim, Axel and Sorger, Uli and Hug, Florian}},
  booktitle    = {{ITG Conference on Voice Communication [8. ITG-Fachtagung]}},
  isbn         = {{978-1-4244-1483-3}},
  issn         = {{1520-6149}},
  language     = {{eng}},
  pages        = {{4197--4200}},
  title        = {{Doppler-variant modeling of the vocal tract}},
  url          = {{https://lup.lub.lu.se/search/files/5326068/1593177.pdf}},
  doi          = {{10.1109/ICASSP.2008.4518580}},
  year         = {{2008}},
}