Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

F0 and Segment Duration in Formant Synthesis of Speaker Age

Schötz, Susanne LU (2006) Speech Prosody 2006 p.515-518
Abstract
This paper describes the work with F0 and segment duration when developing a prototype system for analysis of speaker age using data-driven formant synthesis. The system was developed to extract 23 parameters from the test words—spoken by four differently aged female speakers of the same dialect and family — and to generate synthetic copies. Audio-visual feedback enabled the user to compare the natural and synthetic versions and facilitated parameter adjustment. Next, weighted linear interpolation was used in a first crude attempt to synthesize speaker age. Evaluation of the system revealed its strengths and weaknesses, and suggested further improvements. F0 and duration performed better than most other parameters.
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
host publication
Proc. of Speech Prosody
pages
515 - 518
publisher
Dresden
conference name
Speech Prosody 2006
conference location
Dresden, Germany
conference dates
2006-05-02 - 2006-05-05
external identifiers
  • scopus:85045758972
language
English
LU publication?
yes
additional info
The information about affiliations in this record was updated in December 2015. The record was previously connected to the following departments: Linguistics and Phonetics (015010003)
id
e2afea8b-91fb-4440-aa63-ae44dfaa1018 (old id 1471511)
date added to LUP
2016-04-04 11:03:06
date last changed
2020-10-04 06:41:42
@inproceedings{e2afea8b-91fb-4440-aa63-ae44dfaa1018,
  abstract     = {This paper describes the work with F0 and segment duration when developing a prototype system for analysis of speaker age using data-driven formant synthesis. The system was developed to extract 23 parameters from the test words—spoken by four differently aged female speakers of the same dialect and family — and to generate synthetic copies. Audio-visual feedback enabled the user to compare the natural and synthetic versions and facilitated parameter adjustment. Next, weighted linear interpolation was used in a first crude attempt to synthesize speaker age. Evaluation of the system revealed its strengths and weaknesses, and suggested further improvements. F0 and duration performed better than most other parameters.},
  author       = {Schötz, Susanne},
  booktitle    = {Proc. of Speech Prosody},
  language     = {eng},
  pages        = {515--518},
  publisher    = {Dresden},
  title        = {F0 and Segment Duration in Formant Synthesis of Speaker Age},
  url          = {https://lup.lub.lu.se/search/ws/files/5683046/1471512.pdf},
  year         = {2006},
}