anas-rz / specmix-pytorch
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆11Updated 2 years ago
Alternatives and similar repositories for specmix-pytorch
Users that are interested in specmix-pytorch are comparing it to the libraries listed below
Sorting:
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- ☆30Updated last year
- Streaming Audiotransformers for online Audio tagging☆44Updated 11 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- ☆26Updated last year
- ☆15Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆43Updated 2 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆16Updated 3 weeks ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆28Updated 10 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated 8 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 9 months ago
- ☆19Updated 2 years ago
- TODO☆38Updated last year
- Framework for training and evaluating self-supervised learning methods for speaker verification.☆23Updated 2 months ago
- MSP-Podcast Challenge Baseline Code☆22Updated 11 months ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- ☆65Updated last year
- Official repository of NeXt-TDNN for speaker verification☆71Updated 7 months ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆40Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆26Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- [Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection☆29Updated last month
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆33Updated 9 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated last year