DCASE-2024-Workshop Papers

The 9^th Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2024, will take place in Tokyo, Japan on 23-25 October.

Papers

Not yet available

ID	Category	Task	Proposed Model(s)
1	Scenes	Data-Efficient Low-Complexity Acoustic Scene Classification	A CNN-based approach
2	Monitoring	First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring	Autoencoder-based baseline with two operating modes
3	Localization	Audio and audiovisual sound event localization and detection with source distance estimation	Track A: Audio-only baseline Track B: Audiovisual baseline
4	Events	Sound event detection with heterogeneous training dataset and potentially missing labels	Mean-Teacher model
5	Bio	Few-shot Bioacoustic Event Detection	Prototypical Network with negative sampling
6	Caption	Automated Audio Captioning	A sequence-to-sequence system
7	Synthesis	Sound Scene Synthesis	AudioLDM
8	Retrieval	Language-Based Audio Retrieval	A bi-encoder architecture with a pre-trained CNN14 (see PANNs ) being the audio encoder and the Sentence-BERT (i.e., "all-mpnet-base-v2") being the text encoder
9	Separation	Language-Queried Audio Source Separation	LASS-Net
10	Monitoring	Acoustic-based traffic monitoring	A Convolutional Recurrent Neural Network (CRNN) based architecture

ID	Title	Tasks	Year	Repo
1	TAU Urban Acoustic Scenes 2022 Mobile, Development dataset	1	2022	1. TAU Urban Acoustic Scenes 2. Development-train splits
2	Task 2, Development dataset	2	2024
3	Sony-TAu Realistic Spatial Soundscapes 2023 (STARSS23)	3	2023
4	DESED dataset + MAESTRO dataset	4	2024	1. DESED data generation script 2. MAESTRO
5	Task 5, Development dataset	5	2024
6	Clotho v2.1	6	2024
7	Task 7, Development dataset	7	2024
8	Clotho v2.1	8	2024
9	DCASE 2024 LASS Development + Validation(synth)	9	2024	1. Development 2. Validation
10	Task 10, Development dataset	10	2024