Classification Threshold and Training Data Affect the Quality and Utility of Focal Species Data Processed with Automated Audio-Recognition Software

Automated recognition is increasingly used to extract information about species vocalizations from audio recordings. During processing, recognizers calculate the probability of correct classification (“score”) for each vocalization. Our goal was to investigate the implications of recognizer score for ecological research and monitoring. We found a strong relationship between score and the distance at which a call was recorded if the recognizer was trained with calls recorded at close range. These results show that score threshold choice is a decision about sampling area, not just about the balance between false negative and false positive results. Overall, we showed that training recognizers with ‘high-quality’ clips that were recorded at close range will improve the utility of the data without affecting how many true positives the recognizer detects.

Data and Resources

Additional Info

Field Value
Short Name of Publication Classification Threshold and Training Data Affect the Quality and Utility of Focal Species Data Processed with Automated Audio-Recognition Software
Deliverable Type Peer-reviewed Publication
Program Catagory OSM
Program Type OSM
Author Elly C. Knight & Erin M. Bayne
Periodical Title Taylor & Francis Online
Year of Publication 2019
Publishing Organization ABMI
Month of Publication 05
Periodical Volumes 03 Aug 2018
Page Range 539-554
Digital Object Identifier (DOI) 10.1080/09524622.2018.1503971
Online ISBN/ISSN
Print ISBN/ISSN
Recomended Citation Elly C. Knight & Erin M. Bayne (2019) Classification threshold and training data affect the quality and utility of focal species data processed with automated audio-recognition software, Bioacoustics, 28:6, 539-554, DOI: 10.1080/09524622.2018.1503971
Maintainer