yqcai888 / easy_dcase_task1
This repository provides an easy way to train your models on the datasets of DCASE task 1.
☆14Updated last month
Alternatives and similar repositories for easy_dcase_task1:
Users that are interested in easy_dcase_task1 are comparing it to the libraries listed below
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated last month
- ☆31Updated this week
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆55Updated 3 years ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆24Updated last year
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆57Updated 9 months ago
- PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio☆165Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- Speech Separation☆61Updated 11 months ago
- ☆97Updated last year
- ☆78Updated 2 years ago
- ☆77Updated 8 months ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆173Updated 2 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 6 months ago
- Official repository of our paper: https://arxiv.org/abs/2010.15366☆62Updated 3 years ago
- ☆49Updated 2 years ago
- Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation☆101Updated 2 years ago
- ☆23Updated 2 years ago
- Beam-guided TasNet☆49Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆54Updated 2 years ago
- Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)☆68Updated 4 years ago
- ☆90Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆34Updated 4 months ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆120Updated 4 months ago
- ☆188Updated last year
- ☆58Updated 3 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆111Updated last year
- DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping met…☆53Updated 2 years ago
- A fast implementation of bss_eval metrics for blind source separation☆134Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆40Updated 2 years ago
- ☆26Updated 2 years ago