Uncertainty and ignored information in the analysis of bat ultrasound : Bayesian approximation to the rescue
(2022) In Ecological Informatics 70.- Abstract
Bat ultrasound analysis has been around for several decades and it is one of the most important tools in studies of bat ecology. Discrimination between species is based on intra-specific features of echolocation calls. Identification of species and genera in audio files can be attempted either manually or through software which performs a fully automated discrimination between species. However, significant overlap in various features (e.g. frequencies of calls) exists between species and even genera. Species ID is therefore often not an absolute conclusion, but rather an opinion or best guess, as opposed to DNA tests or measurements on external characters of captured bats. To make things even worse, the probability of actually observing... (More)
Bat ultrasound analysis has been around for several decades and it is one of the most important tools in studies of bat ecology. Discrimination between species is based on intra-specific features of echolocation calls. Identification of species and genera in audio files can be attempted either manually or through software which performs a fully automated discrimination between species. However, significant overlap in various features (e.g. frequencies of calls) exists between species and even genera. Species ID is therefore often not an absolute conclusion, but rather an opinion or best guess, as opposed to DNA tests or measurements on external characters of captured bats. To make things even worse, the probability of actually observing a bat of a given species in space and time is ignored when performing bat ultrasound analysis. This study introduces Bayesian approximation through a new method we have named Alternative Bayesian Bat Analysis (ABBA). We show, through a simple proof-of-concept example, the importance of adding information about the local composition of the bat community, hence making informed decisions regarding which species is most likely present in audio files. The superior performance of ABBA is also shown through an example using R code. Here, we use simulated data for three Pipistrellus spp., a genus with significant overlap in frequencies, but the code can easily be adapted to other bat species and genera worldwide. ABBA outperformed the non-Bayesian approach for all three species. The rare species in the simulated data set was super-inflated when using the non-Bayesian method. Further the results show, contrarily to common belief, that the frequency dominated by a given species in a data set, depends on the composition of the bat fauna and not just means and SDs reported in the literature. ABBA allows researchers to include all observations in statistical modeling, rather than excluding observations, an approach which can affect the reliability of studies. This study also, to a great extent, explains the poor performance of software attempting automated bat ID. Implementing Bayesian algorithms, and thereby allowing users to interact with the software, should significantly improve their performance.
(Less)
- author
- Michaelsen, Tore Christian ; Rydell, Jens ; Bååth, Rasmus LU and Jensen, Knut Helge
- organization
- publishing date
- 2022-09
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Bat ultrasound analysis, Bayesian statistics, GNU R, Probability theory
- in
- Ecological Informatics
- volume
- 70
- article number
- 101721
- publisher
- Elsevier
- external identifiers
-
- scopus:85132900113
- ISSN
- 1574-9541
- DOI
- 10.1016/j.ecoinf.2022.101721
- language
- English
- LU publication?
- yes
- id
- 52a45eed-c45f-4bdc-a264-a4ecb0a5cae1
- date added to LUP
- 2022-09-05 14:44:28
- date last changed
- 2022-09-05 14:44:28
@article{52a45eed-c45f-4bdc-a264-a4ecb0a5cae1, abstract = {{<p>Bat ultrasound analysis has been around for several decades and it is one of the most important tools in studies of bat ecology. Discrimination between species is based on intra-specific features of echolocation calls. Identification of species and genera in audio files can be attempted either manually or through software which performs a fully automated discrimination between species. However, significant overlap in various features (e.g. frequencies of calls) exists between species and even genera. Species ID is therefore often not an absolute conclusion, but rather an opinion or best guess, as opposed to DNA tests or measurements on external characters of captured bats. To make things even worse, the probability of actually observing a bat of a given species in space and time is ignored when performing bat ultrasound analysis. This study introduces Bayesian approximation through a new method we have named Alternative Bayesian Bat Analysis (ABBA). We show, through a simple proof-of-concept example, the importance of adding information about the local composition of the bat community, hence making informed decisions regarding which species is most likely present in audio files. The superior performance of ABBA is also shown through an example using R code. Here, we use simulated data for three Pipistrellus spp., a genus with significant overlap in frequencies, but the code can easily be adapted to other bat species and genera worldwide. ABBA outperformed the non-Bayesian approach for all three species. The rare species in the simulated data set was super-inflated when using the non-Bayesian method. Further the results show, contrarily to common belief, that the frequency dominated by a given species in a data set, depends on the composition of the bat fauna and not just means and SDs reported in the literature. ABBA allows researchers to include all observations in statistical modeling, rather than excluding observations, an approach which can affect the reliability of studies. This study also, to a great extent, explains the poor performance of software attempting automated bat ID. Implementing Bayesian algorithms, and thereby allowing users to interact with the software, should significantly improve their performance.</p>}}, author = {{Michaelsen, Tore Christian and Rydell, Jens and Bååth, Rasmus and Jensen, Knut Helge}}, issn = {{1574-9541}}, keywords = {{Bat ultrasound analysis; Bayesian statistics; GNU R; Probability theory}}, language = {{eng}}, publisher = {{Elsevier}}, series = {{Ecological Informatics}}, title = {{Uncertainty and ignored information in the analysis of bat ultrasound : Bayesian approximation to the rescue}}, url = {{http://dx.doi.org/10.1016/j.ecoinf.2022.101721}}, doi = {{10.1016/j.ecoinf.2022.101721}}, volume = {{70}}, year = {{2022}}, }