Production Strategies of Vocal Attitudes
(2022) 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2022-September. p.4985-4989- Abstract
Humans have an impressive ability to communicate precise social intentions and desires with their voice - through vocal attitudes. Previous studies have shown how isolated acoustic features such as pitch can convey social attitudes, but have mostly worked with single attitudes and have not controlled for inter-speaker variability. Thus, the vocal behaviours used to produce social attitudes remain mostly unknown. That is the aim of the current study, to uncover the anatomic production strategies that speakers use to communicate vocal attitudes. To do this, we analysed recordings from N=20 French speakers producing dominant, friendly, seductive and distant speech. For each of these attitudes, we investigated their vocal fold behaviour,... (More)
Humans have an impressive ability to communicate precise social intentions and desires with their voice - through vocal attitudes. Previous studies have shown how isolated acoustic features such as pitch can convey social attitudes, but have mostly worked with single attitudes and have not controlled for inter-speaker variability. Thus, the vocal behaviours used to produce social attitudes remain mostly unknown. That is the aim of the current study, to uncover the anatomic production strategies that speakers use to communicate vocal attitudes. To do this, we analysed recordings from N=20 French speakers producing dominant, friendly, seductive and distant speech. For each of these attitudes, we investigated their vocal fold behaviour, vocal tract actuation and phonetic speech structure, with the support of deep alignment methods, and compared them with group statistics. We notably produced high-level representations of speakers' articulation (e.g. Vowel Space Density) and speech rhythm. Our results reveal speakers' prototypical strategies to produce vocal attitudes, and highlight how vocal behaviours can communicate social signals. We expect these results to provide an objective validation method for deep voice attitude conversions.
(Less)
- author
- Salais, Léane ; Arias, Pablo LU ; Le Moine, Clément ; Rosi, Victor ; Teytaut, Yann ; Obin, Nicolas and Roebel, Axel
- organization
- publishing date
- 2022
- type
- Chapter in Book/Report/Conference proceeding
- publication status
- published
- subject
- keywords
- articulation, speech production, vocal social attitudes
- host publication
- Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
- series title
- Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
- volume
- 2022-September
- pages
- 5 pages
- conference name
- 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022
- conference location
- Incheon, Korea, Republic of
- conference dates
- 2022-09-18 - 2022-09-22
- external identifiers
-
- scopus:85140054205
- ISSN
- 2308-457X
- DOI
- 10.21437/Interspeech.2022-10947
- language
- English
- LU publication?
- yes
- id
- c357e511-a327-4f8e-96d2-ea29deafd4ed
- date added to LUP
- 2022-12-19 15:12:36
- date last changed
- 2023-09-11 11:31:05
@inproceedings{c357e511-a327-4f8e-96d2-ea29deafd4ed, abstract = {{<p>Humans have an impressive ability to communicate precise social intentions and desires with their voice - through vocal attitudes. Previous studies have shown how isolated acoustic features such as pitch can convey social attitudes, but have mostly worked with single attitudes and have not controlled for inter-speaker variability. Thus, the vocal behaviours used to produce social attitudes remain mostly unknown. That is the aim of the current study, to uncover the anatomic production strategies that speakers use to communicate vocal attitudes. To do this, we analysed recordings from N=20 French speakers producing dominant, friendly, seductive and distant speech. For each of these attitudes, we investigated their vocal fold behaviour, vocal tract actuation and phonetic speech structure, with the support of deep alignment methods, and compared them with group statistics. We notably produced high-level representations of speakers' articulation (e.g. Vowel Space Density) and speech rhythm. Our results reveal speakers' prototypical strategies to produce vocal attitudes, and highlight how vocal behaviours can communicate social signals. We expect these results to provide an objective validation method for deep voice attitude conversions.</p>}}, author = {{Salais, Léane and Arias, Pablo and Le Moine, Clément and Rosi, Victor and Teytaut, Yann and Obin, Nicolas and Roebel, Axel}}, booktitle = {{Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}}, issn = {{2308-457X}}, keywords = {{articulation; speech production; vocal social attitudes}}, language = {{eng}}, pages = {{4985--4989}}, series = {{Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH}}, title = {{Production Strategies of Vocal Attitudes}}, url = {{http://dx.doi.org/10.21437/Interspeech.2022-10947}}, doi = {{10.21437/Interspeech.2022-10947}}, volume = {{2022-September}}, year = {{2022}}, }