anas-rz / specmix-pytorch
A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features
☆11Updated 2 years ago
Alternatives and similar repositories for specmix-pytorch:
Users that are interested in specmix-pytorch are comparing it to the libraries listed below
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆11Updated 4 months ago
- ☆19Updated last year
- ☆43Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆70Updated 5 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated 7 months ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated last year
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆36Updated 9 months ago
- TODO☆37Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆37Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated 11 months ago
- ☆23Updated last year
- ☆30Updated last year
- ☆14Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆51Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆76Updated 4 years ago
- Analysis of XLS-R for Speech Quality Assessment☆13Updated last month
- Clustering-based methods for overlapping diarization☆80Updated last year
- ☆30Updated 4 months ago
- EVAR ~ Evaluation package for Audio Representations☆49Updated 4 months ago
- An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper☆18Updated 2 years ago
- ☆65Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆36Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago