theMoro / DIRAugmentation
Improving Recording Device Generalization using Impulse Response Augmentation
☆14Updated last year
Alternatives and similar repositories for DIRAugmentation:
Users that are interested in DIRAugmentation are comparing it to the libraries listed below
- Streaming Audiotransformers for online Audio tagging☆43Updated 8 months ago
- ☆18Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Inference code for PaSST, using the HEAR API.☆31Updated last year
- ☆22Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 6 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- ☆20Updated last year
- ☆56Updated 4 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆45Updated last month
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- ☆13Updated 2 years ago
- This is the official implementation of the LiSenNet☆55Updated 3 months ago
- ☆64Updated last year
- ☆69Updated 2 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- ☆29Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ☆48Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆25Updated 6 months ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆39Updated last year
- ☆23Updated 4 months ago
- ☆45Updated 2 months ago
- ☆28Updated 9 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 weeks ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆22Updated 2 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆26Updated 3 weeks ago