denfed / wave-spec-fusionLinks
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆15Updated 3 years ago
Alternatives and similar repositories for wave-spec-fusion
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
Sorting:
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated 9 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆55Updated 2 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- ☆36Updated 2 months ago
- Paderborn Sound Event Detection☆74Updated 2 years ago
- Unofficial Pytorch Lightning Implementation of "A New Framework for CNN-Based Speech Enhancement in the Time Domain"☆20Updated 2 years ago
- ☆53Updated 2 years ago
- ☆56Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆59Updated last month
- Discriminative Training of VBx Diarization☆25Updated 9 months ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆20Updated 3 years ago
- MultiSV: scripts for data preparation☆27Updated 6 months ago
- PyTorch implementation of LiMuSE☆31Updated 2 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 4 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆50Updated last year
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆62Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- ☆18Updated 3 years ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated 9 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- 2022 DCASE Challenge☆12Updated 9 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆57Updated 10 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆60Updated 9 months ago
- ☆32Updated 2 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆37Updated 9 months ago
- ☆15Updated 3 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year