F0 and Segment Duration in Formant Synthesis of Speaker Age

Schötz, Susanne

F0 and Segment Duration in Formant Synthesis of Speaker Age

Mark

Schötz, Susanne ^LU

(2006) Speech Prosody 2006 p.515-518

Abstract: This paper describes the work with F0 and segment duration when developing a prototype system for analysis of speaker age using data-driven formant synthesis. The system was developed to extract 23 parameters from the test words—spoken by four differently aged female speakers of the same dialect and family — and to generate synthetic copies. Audio-visual feedback enabled the user to compare the natural and synthetic versions and facilitated parameter adjustment. Next, weighted linear interpolation was used in a ﬁrst crude attempt to synthesize speaker age. Evaluation of the system revealed its strengths and weaknesses, and suggested further improvements. F0 and duration performed better than most other parameters.

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/1471511

author

Schötz, Susanne ^LU

organization

Phonetics

publishing date

2006

type

Chapter in Book/Report/Conference proceeding

publication status

published

subject

Comparative Language Studies and Linguistics

host publication

Proc. of Speech Prosody

pages

515 - 518

publisher

Dresden

conference name

Speech Prosody 2006

conference location

Dresden, Germany

conference dates

2006-05-02 - 2006-05-05

external identifiers

scopus:85045758972

language

English

LU publication?

yes

additional info

The information about affiliations in this record was updated in December 2015. The record was previously connected to the following departments: Linguistics and Phonetics (015010003)

id

e2afea8b-91fb-4440-aa63-ae44dfaa1018 (old id 1471511)

date added to LUP

2016-04-04 11:03:06

date last changed

2025-10-14 09:42:48

@inproceedings{e2afea8b-91fb-4440-aa63-ae44dfaa1018,
  abstract     = {{This paper describes the work with F0 and segment duration when developing a prototype system for analysis of speaker age using data-driven formant synthesis. The system was developed to extract 23 parameters from the test words—spoken by four differently aged female speakers of the same dialect and family — and to generate synthetic copies. Audio-visual feedback enabled the user to compare the natural and synthetic versions and facilitated parameter adjustment. Next, weighted linear interpolation was used in a ﬁrst crude attempt to synthesize speaker age. Evaluation of the system revealed its strengths and weaknesses, and suggested further improvements. F0 and duration performed better than most other parameters.}},
  author       = {{Schötz, Susanne}},
  booktitle    = {{Proc. of Speech Prosody}},
  language     = {{eng}},
  pages        = {{515--518}},
  publisher    = {{Dresden}},
  title        = {{F0 and Segment Duration in Formant Synthesis of Speaker Age}},
  url          = {{https://lup.lub.lu.se/search/files/5683046/1471512.pdf}},
  year         = {{2006}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

F0 and Segment Duration in Formant Synthesis of Speaker Age