Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Production Strategies of Vocal Attitudes

Salais, Léane ; Arias, Pablo LU ; Le Moine, Clément ; Rosi, Victor ; Teytaut, Yann ; Obin, Nicolas and Roebel, Axel (2022) 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022-September. p.4985-4989
Abstract

Humans have an impressive ability to communicate precise social intentions and desires with their voice - through vocal attitudes. Previous studies have shown how isolated acoustic features such as pitch can convey social attitudes, but have mostly worked with single attitudes and have not controlled for inter-speaker variability. Thus, the vocal behaviours used to produce social attitudes remain mostly unknown. That is the aim of the current study, to uncover the anatomic production strategies that speakers use to communicate vocal attitudes. To do this, we analysed recordings from N=20 French speakers producing dominant, friendly, seductive and distant speech. For each of these attitudes, we investigated their vocal fold behaviour,... (More)

Humans have an impressive ability to communicate precise social intentions and desires with their voice - through vocal attitudes. Previous studies have shown how isolated acoustic features such as pitch can convey social attitudes, but have mostly worked with single attitudes and have not controlled for inter-speaker variability. Thus, the vocal behaviours used to produce social attitudes remain mostly unknown. That is the aim of the current study, to uncover the anatomic production strategies that speakers use to communicate vocal attitudes. To do this, we analysed recordings from N=20 French speakers producing dominant, friendly, seductive and distant speech. For each of these attitudes, we investigated their vocal fold behaviour, vocal tract actuation and phonetic speech structure, with the support of deep alignment methods, and compared them with group statistics. We notably produced high-level representations of speakers' articulation (e.g. Vowel Space Density) and speech rhythm. Our results reveal speakers' prototypical strategies to produce vocal attitudes, and highlight how vocal behaviours can communicate social signals. We expect these results to provide an objective validation method for deep voice attitude conversions.

(Less)
Please use this url to cite or link to this publication:
author
; ; ; ; ; and
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
keywords
articulation, speech production, vocal social attitudes
host publication
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
series title
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
volume
2022-September
pages
5 pages
conference name
23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
conference location
Incheon, Korea, Republic of
conference dates
2022-09-18 - 2022-09-22
external identifiers
  • scopus:85140054205
ISSN
2308-457X
DOI
10.21437/Interspeech.2022-10947
language
English
LU publication?
yes
id
c357e511-a327-4f8e-96d2-ea29deafd4ed
date added to LUP
2022-12-19 15:12:36
date last changed
2023-09-11 11:31:05
@inproceedings{c357e511-a327-4f8e-96d2-ea29deafd4ed,
  abstract     = {{<p>Humans have an impressive ability to communicate precise social intentions and desires with their voice - through vocal attitudes. Previous studies have shown how isolated acoustic features such as pitch can convey social attitudes, but have mostly worked with single attitudes and have not controlled for inter-speaker variability. Thus, the vocal behaviours used to produce social attitudes remain mostly unknown. That is the aim of the current study, to uncover the anatomic production strategies that speakers use to communicate vocal attitudes. To do this, we analysed recordings from N=20 French speakers producing dominant, friendly, seductive and distant speech. For each of these attitudes, we investigated their vocal fold behaviour, vocal tract actuation and phonetic speech structure, with the support of deep alignment methods, and compared them with group statistics. We notably produced high-level representations of speakers' articulation (e.g. Vowel Space Density) and speech rhythm. Our results reveal speakers' prototypical strategies to produce vocal attitudes, and highlight how vocal behaviours can communicate social signals. We expect these results to provide an objective validation method for deep voice attitude conversions.</p>}},
  author       = {{Salais, Léane and Arias, Pablo and Le Moine, Clément and Rosi, Victor and Teytaut, Yann and Obin, Nicolas and Roebel, Axel}},
  booktitle    = {{Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}},
  issn         = {{2308-457X}},
  keywords     = {{articulation; speech production; vocal social attitudes}},
  language     = {{eng}},
  pages        = {{4985--4989}},
  series       = {{Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}},
  title        = {{Production Strategies of Vocal Attitudes}},
  url          = {{http://dx.doi.org/10.21437/Interspeech.2022-10947}},
  doi          = {{10.21437/Interspeech.2022-10947}},
  volume       = {{2022-September}},
  year         = {{2022}},
}