Event Detection in Bioacoustic Datasets using Few Shot Learning

dc.contributor.authorDas, Ritam
dc.date.accessioned2025-02-17T06:23:19Z
dc.date.available2025-02-17T06:23:19Z
dc.date.issued2024-05
dc.description.abstractBioacoustic research is pivotal in understanding biodiversity, ecosystem health, and animal communication patterns. Despite its significance, acquiring high-quality bioacous- tic datasets remains challenging due to environmental variability and limited field access. Recorded datasets are often very long, but the signals of interest are short and rare, making manual annotation a time- and labor-intensive task. In recent years, deep learning techniques have shown promise in bioacoustic event de- tection tasks, offering data-efficient methods for analyzing large and sparse volumes of audio recordings. This dissertation investigates applying a deep learning method called Prototypical Networks to a large dataset of meerkat (Suricata suricatta) vocalizations. Pro- totypical Networks have gained attention for their ability to generalize well to unseen sam- ples when exposed to only a few labeled ones. The meerkat dataset is an ideal testbed for bioacoustic event detection as it is sparse and covered with a challenging amount of various noise, but comes with label information for almost 200h of audio. The proposed model builds on the baseline approach provided as part of the DCASE 2023 bioacoustic event detection challenge, a yearly challenge of the bioacoustic research community. Our findings highlight the potential and limitations of Prototypical Networks with data obtained under extreme conditions. This dissertation discusses use cases, best practices, and potential future research paths for designing a robust few-shot event detection model that is able to capture essential features in sparse and noisy data across a spectrum of bioacoustic datasets. Keywords: Bioacoustic Research, Event Detection, Few-shot Learning Prototypical Networks, Meerkat Vocalizationsen_US
dc.guidePrasad, N Gen_US
dc.identifier.urihttp://hdl.handle.net/123456789/5632
dc.language.isoenen_US
dc.publisherIISER Mohalien_US
dc.subjectSound Event Detectionen_US
dc.subjectMel Scaleen_US
dc.subjectClap Sounden_US
dc.subjectPrecision and Recallen_US
dc.titleEvent Detection in Bioacoustic Datasets using Few Shot Learningen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Yet to obtain consent.pdf
Size:
144.56 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections