denfed / wave-spec-fusionLinks
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆15Updated 4 years ago
Alternatives and similar repositories for wave-spec-fusion
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
Sorting:
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated 10 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- ☆83Updated 2 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Paderborn Sound Event Detection☆74Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- ☆58Updated last year
- ☆55Updated 2 years ago
- PyTorch implementation of LiMuSE☆31Updated 2 years ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆20Updated 2 years ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆39Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆62Updated 2 months ago
- ☆37Updated 3 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 4 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year
- PyTorch implementation of the LEAF audio frontend☆73Updated 2 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 4 months ago
- The source code of Tim-TSENet☆14Updated 3 years ago
- ☆87Updated last year
- Official implementation of EfficientLEAF, a learnable audio frontend.☆47Updated 2 years ago
- Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python☆47Updated 5 years ago
- ☆37Updated 2 months ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆39Updated 3 years ago
- ☆51Updated 3 years ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆112Updated last year
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 10 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆61Updated 9 months ago
- ☆30Updated 4 years ago
- AudioLDM training, finetuning, evaluation and inference.☆15Updated last year
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆37Updated 9 months ago