vak: a neural network framework for researchers studying animal acoustic communication

David Nicholson, Yarden Cohen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

How is speech like birdsong? What do we mean when we say an animal learns their vocalizations? Questions like these are answered by studying how animals communicate with sound. As in many other fields, the study of acoustic communication is being revolutionized by deep neural network models. These models enable answering questions that were previously impossible to address, in part because the models automate analysis of very large datasets. Acoustic communication researchers have developed multiple models for similar tasks, often implemented as research code with one of several libraries, such as Keras and Pytorch. This situation has created a real need for a framework that allows researchers to easily benchmark multiple models, and test new models, with their own data. To address this need, we developed vak (https://github.com/vocalpy/vak), a neural network framework designed for acoustic communication researchers.(" vak" is pronounced like " talk" or" squawk" and was chosen for its similarity to the Latin root voc, as in " vocal".) Here we describe the design of the vak, and explain how the framework makes it easy for researchers to apply neural network models to their own data. We highlight enhancements made in version 1.0 that significantly improve user experience with the library. To provide researchers without expertise in deep learning access to these models, vak can be run via a command-line interface that uses configuration files. Vak can also be used directly in scripts by scientist-coders. To achieve this, vak adapts design patterns and an API from other domain-specific PyTorch libraries such as torchvision …
Original languageEnglish
Title of host publicationProceedings of the 22nd Python in Science Conference (SciPy 2023)
EditorsMeghann Agarwal, Chris Calloway, Dillon Niederhut
Pages59-67
Number of pages9
DOIs
StatePublished - Jul 2023
EventPython in Science Conference - Austin, Texas, Austin, United States
Duration: 10 Jul 202316 Jul 2023
Conference number: 22
https://conference.scipy.org/proceedings/scipy2023/

Conference

ConferencePython in Science Conference
Abbreviated titleSciPy 2023
Country/TerritoryUnited States
CityAustin
Period10/07/2316/07/23
Internet address

Fingerprint

Dive into the research topics of 'vak: a neural network framework for researchers studying animal acoustic communication'. Together they form a unique fingerprint.

Cite this