Abstract
Animal vocalizations are ubiquitous produced by various taxa and represented in all habitats. Tracking and quantifying animal vocalizations is a basic necessity in various biological disciplines such as nature conservation and biomonitoring. With the advancement of digital recording technology, a huge amount of audio recordings is accumulated. Since manual annotation and an analysis of relevant acoustic features is impractical, development of reliable algorithms for an automatic analysis of birdsong is highly required. One of the first challenges in a birdsong analysis is that of segmentation of the acoustic signal, i.e. detection and demarcation of its basic elements or syllables prior to a further analysis. In this study, we present two simple unsupervised algorithms for automatic birdsong segmentation and parameter estimation. The algorithms are based on a smoothed envelope of the short-time energy of the signal, parameters derived from the fundamental frequency and short-time Fourier transform (STFT). The methods were evaluated using a small database of trill vocalizations recorded with high background noise. The algorithms output was compared to manual segmentation carried out by a human expert and to ground truth values obtained by using synthetic signals after which it was realized that they produced highly similar results. In summary, the methods are shown to accurately segment birdsong signals with high background noise levels. Since they are simple to implement, they could be of great benefit to bioacoustics researchers.
Original language | English |
---|---|
Title of host publication | Proceedings of IWSSIP 2019 - 2019 International Conference on Systems, Signals and Image Processing |
Editors | Snjezana Rimac-Drlje, Drago Zagar, Irena Galic, Goran Martinovic, Denis Vranjes, Marija Habijan |
Publisher | IEEE Computer Society |
Pages | 211-216 |
Number of pages | 6 |
ISBN (Electronic) | 9781728132273 |
DOIs | |
State | Published - Jun 2019 |
Event | 26th International Conference on Systems, Signals and Image Processing, IWSSIP 2019 - Osijek, Croatia Duration: 5 Jun 2019 → 7 Jun 2019 |
Publication series
Name | International Conference on Systems, Signals, and Image Processing |
---|---|
Volume | 2019-June |
Conference
Conference | 26th International Conference on Systems, Signals and Image Processing, IWSSIP 2019 |
---|---|
Country/Territory | Croatia |
City | Osijek |
Period | 5/06/19 → 7/06/19 |
Keywords
- audio segmentation
- audio signal processing
- bioacoustics
- bird vocalization
- birdsong analysis
All Science Journal Classification (ASJC) codes
- Software
- Artificial Intelligence
- Signal Processing
- Computer Networks and Communications