yqcai888 / easy_dcase_task1
This repository provides an easy way to train your models on the datasets of DCASE task 1.
☆12Updated 2 weeks ago
Alternatives and similar repositories for easy_dcase_task1:
Users that are interested in easy_dcase_task1 are comparing it to the libraries listed below
- ☆23Updated this week
- Official data preparation scripts for the URGENT 2024 Challenge☆75Updated last week
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆23Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ☆49Updated 2 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆54Updated 3 years ago
- ☆31Updated 7 months ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆57Updated 8 months ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆37Updated 2 years ago
- AudioLDM training, finetuning, evaluation and inference.☆14Updated 9 months ago
- ☆31Updated 2 months ago
- ☆18Updated 2 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆68Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆32Updated 3 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 9 months ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆53Updated 2 years ago
- Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)☆67Updated 4 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆79Updated 4 months ago
- ☆183Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆111Updated 4 months ago
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆162Updated last year
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆159Updated last month
- ☆94Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆111Updated last month
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆114Updated 3 months ago
- Baseline method for sound event localization task of DCASE 2023 challenge☆44Updated last year
- ☆16Updated 2 months ago
- ☆63Updated 4 months ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆29Updated last year
- ☆135Updated 11 months ago