F0 and Segment Duration in Formant Synthesis of Speaker Age
(2006) Speech Prosody 2006 p.515-518- Abstract
- This paper describes the work with F0 and segment duration when developing a prototype system for analysis of speaker age using data-driven formant synthesis. The system was developed to extract 23 parameters from the test words—spoken by four differently aged female speakers of the same dialect and family — and to generate synthetic copies. Audio-visual feedback enabled the user to compare the natural and synthetic versions and facilitated parameter adjustment. Next, weighted linear interpolation was used in a first crude attempt to synthesize speaker age. Evaluation of the system revealed its strengths and weaknesses, and suggested further improvements. F0 and duration performed better than most other parameters.
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/1471511
- author
- Schötz, Susanne LU
- organization
- publishing date
- 2006
- type
- Chapter in Book/Report/Conference proceeding
- publication status
- published
- subject
- host publication
- Proc. of Speech Prosody
- pages
- 515 - 518
- publisher
- Dresden
- conference name
- Speech Prosody 2006
- conference location
- Dresden, Germany
- conference dates
- 2006-05-02 - 2006-05-05
- external identifiers
-
- scopus:85045758972
- language
- English
- LU publication?
- yes
- additional info
- The information about affiliations in this record was updated in December 2015. The record was previously connected to the following departments: Linguistics and Phonetics (015010003)
- id
- e2afea8b-91fb-4440-aa63-ae44dfaa1018 (old id 1471511)
- date added to LUP
- 2016-04-04 11:03:06
- date last changed
- 2023-09-20 02:31:33
@inproceedings{e2afea8b-91fb-4440-aa63-ae44dfaa1018, abstract = {{This paper describes the work with F0 and segment duration when developing a prototype system for analysis of speaker age using data-driven formant synthesis. The system was developed to extract 23 parameters from the test words—spoken by four differently aged female speakers of the same dialect and family — and to generate synthetic copies. Audio-visual feedback enabled the user to compare the natural and synthetic versions and facilitated parameter adjustment. Next, weighted linear interpolation was used in a first crude attempt to synthesize speaker age. Evaluation of the system revealed its strengths and weaknesses, and suggested further improvements. F0 and duration performed better than most other parameters.}}, author = {{Schötz, Susanne}}, booktitle = {{Proc. of Speech Prosody}}, language = {{eng}}, pages = {{515--518}}, publisher = {{Dresden}}, title = {{F0 and Segment Duration in Formant Synthesis of Speaker Age}}, url = {{https://lup.lub.lu.se/search/files/5683046/1471512.pdf}}, year = {{2006}}, }