Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Studying vocal annoyance with self-steered sound synthesis

Anikin, Andrey LU orcid and Reby, David (2026) In Journal of Experimental Psychology: General
Abstract
The ability to hold listeners’ attention and prevent habituation is a crucial design feature for both biological and human-made distress and alarm signals, but their intrusive nature also causes stress and annoyance. To investigate what makes vocalizations distracting and annoying, we developed Self-Steered Sound Synthesis—a new experimental paradigm for studying voice perception, in which untrained participants created fully synthetic yet realistic vocalizations by adjusting voice pitch, aspects of voice quality, and temporal structure of short vocalization sequences (dog barks, seagull calls, human shouts, etc.). Independent samples of listeners then evaluated the results. The primary determinants of vocal annoyance were as follows, from... (More)
The ability to hold listeners’ attention and prevent habituation is a crucial design feature for both biological and human-made distress and alarm signals, but their intrusive nature also causes stress and annoyance. To investigate what makes vocalizations distracting and annoying, we developed Self-Steered Sound Synthesis—a new experimental paradigm for studying voice perception, in which untrained participants created fully synthetic yet realistic vocalizations by adjusting voice pitch, aspects of voice quality, and temporal structure of short vocalization sequences (dog barks, seagull calls, human shouts, etc.). Independent samples of listeners then evaluated the results. The primary determinants of vocal annoyance were as follows, from most to least important: (a) the amount of signal per unit of time, including the proportion of time spent vocalizing within a sequence and the proportion of unpleasant-sounding irregular phonation (nonlinear vocal phenomena); (b) the rate of acoustic events such as individual calls or sudden changes in voice quality; and (c) the unpredictable temporal structure of vocalization sequences, which was consistently increased in Self-Steered Sound Synthesis experiments but did not affect annoyance ratings. Thus, sequences of short vocal signals produced at a fast and/or unpredictable rate may be functionally optimal for alerting without undue annoyance, whereas continuous and harsh signals like baby cries cause maximum acoustic stress, making them impossible to ignore. With the Self-Steered Sound Synthesis method, complementary voice production and perception paradigms can be extended far beyond the range of vocal stimuli that participants can produce themselves, with wide-ranging applications in research on human and animal signals and sound design. (PsycInfo Database Record (c) 2026 APA, all rights reserved) (Less)
Please use this url to cite or link to this publication:
author
and
organization
publishing date
type
Contribution to journal
publication status
epub
subject
keywords
vocal communication, distraction, annoyance, attention, nonlinear vocal phenomena
in
Journal of Experimental Psychology: General
publisher
American Psychological Association (APA)
external identifiers
  • pmid:41729734
ISSN
0096-3445
DOI
10.1037/xge0001906
language
English
LU publication?
yes
id
182db963-b426-4e20-a74f-6398b7da705a
date added to LUP
2026-02-28 01:16:52
date last changed
2026-03-20 14:16:47
@article{182db963-b426-4e20-a74f-6398b7da705a,
  abstract     = {{The ability to hold listeners’ attention and prevent habituation is a crucial design feature for both biological and human-made distress and alarm signals, but their intrusive nature also causes stress and annoyance. To investigate what makes vocalizations distracting and annoying, we developed Self-Steered Sound Synthesis—a new experimental paradigm for studying voice perception, in which untrained participants created fully synthetic yet realistic vocalizations by adjusting voice pitch, aspects of voice quality, and temporal structure of short vocalization sequences (dog barks, seagull calls, human shouts, etc.). Independent samples of listeners then evaluated the results. The primary determinants of vocal annoyance were as follows, from most to least important: (a) the amount of signal per unit of time, including the proportion of time spent vocalizing within a sequence and the proportion of unpleasant-sounding irregular phonation (nonlinear vocal phenomena); (b) the rate of acoustic events such as individual calls or sudden changes in voice quality; and (c) the unpredictable temporal structure of vocalization sequences, which was consistently increased in Self-Steered Sound Synthesis experiments but did not affect annoyance ratings. Thus, sequences of short vocal signals produced at a fast and/or unpredictable rate may be functionally optimal for alerting without undue annoyance, whereas continuous and harsh signals like baby cries cause maximum acoustic stress, making them impossible to ignore. With the Self-Steered Sound Synthesis method, complementary voice production and perception paradigms can be extended far beyond the range of vocal stimuli that participants can produce themselves, with wide-ranging applications in research on human and animal signals and sound design. (PsycInfo Database Record (c) 2026 APA, all rights reserved)}},
  author       = {{Anikin, Andrey and Reby, David}},
  issn         = {{0096-3445}},
  keywords     = {{vocal communication; distraction; annoyance; attention; nonlinear vocal phenomena}},
  language     = {{eng}},
  publisher    = {{American Psychological Association (APA)}},
  series       = {{Journal of Experimental Psychology: General}},
  title        = {{Studying vocal annoyance with self-steered sound synthesis}},
  url          = {{http://dx.doi.org/10.1037/xge0001906}},
  doi          = {{10.1037/xge0001906}},
  year         = {{2026}},
}