
Please use this identifier to cite or link to this item:
http://hdl.handle.net/123456789/5632
Title: | Event Detection in Bioacoustic Datasets using Few Shot Learning |
Authors: | Das, Ritam |
Keywords: | Sound Event Detection Mel Scale Clap Sound Precision and Recall |
Issue Date: | May-2024 |
Publisher: | IISER Mohali |
Abstract: | Bioacoustic research is pivotal in understanding biodiversity, ecosystem health, and animal communication patterns. Despite its significance, acquiring high-quality bioacous- tic datasets remains challenging due to environmental variability and limited field access. Recorded datasets are often very long, but the signals of interest are short and rare, making manual annotation a time- and labor-intensive task. In recent years, deep learning techniques have shown promise in bioacoustic event de- tection tasks, offering data-efficient methods for analyzing large and sparse volumes of audio recordings. This dissertation investigates applying a deep learning method called Prototypical Networks to a large dataset of meerkat (Suricata suricatta) vocalizations. Pro- totypical Networks have gained attention for their ability to generalize well to unseen sam- ples when exposed to only a few labeled ones. The meerkat dataset is an ideal testbed for bioacoustic event detection as it is sparse and covered with a challenging amount of various noise, but comes with label information for almost 200h of audio. The proposed model builds on the baseline approach provided as part of the DCASE 2023 bioacoustic event detection challenge, a yearly challenge of the bioacoustic research community. Our findings highlight the potential and limitations of Prototypical Networks with data obtained under extreme conditions. This dissertation discusses use cases, best practices, and potential future research paths for designing a robust few-shot event detection model that is able to capture essential features in sparse and noisy data across a spectrum of bioacoustic datasets. Keywords: Bioacoustic Research, Event Detection, Few-shot Learning Prototypical Networks, Meerkat Vocalizations |
URI: | http://hdl.handle.net/123456789/5632 |
Appears in Collections: | MS-19 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Yet to obtain consent.pdf | 144.56 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.