You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sound event detection with heterogeneous training dataset and potentially missing labels
Mean-Teacher model
5
Bio
Few-shot Bioacoustic Event Detection
Prototypical Network with negative sampling
6
Caption
Automated Audio Captioning
A sequence-to-sequence system
7
Synthesis
Sound Scene Synthesis
AudioLDM
8
Retrieval
Language-Based Audio Retrieval
A bi-encoder architecture with a pre-trained CNN14 (see PANNs ) being the audio encoder and the Sentence-BERT (i.e., "all-mpnet-base-v2") being the text encoder
9
Separation
Language-Queried Audio Source Separation
LASS-Net
10
Monitoring
Acoustic-based traffic monitoring
A Convolutional Recurrent Neural Network (CRNN) based architecture
Datasets
ID
Title
Tasks
Year
Repo
1
TAU Urban Acoustic Scenes 2022 Mobile, Development dataset