theMoro / DIRAugmentation
Improving Recording Device Generalization using Impulse Response Augmentation
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for DIRAugmentation
- ☆18Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- experiments about AudioSet☆43Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- ☆26Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 3 months ago
- EVAR ~ Evaluation package for Audio Representations☆43Updated 2 weeks ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆19Updated 11 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 9 months ago
- ☆68Updated 2 years ago
- ☆43Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆39Updated last year
- ☆14Updated last year
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated last month
- ☆20Updated last month
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆21Updated 8 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- ☆27Updated 7 months ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Adapting a ConvNeXt model to audio classification on AudioSet☆19Updated last year
- ☆41Updated last year
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month