pyyush / SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆70Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for SpecAugment
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆123Updated 2 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆76Updated 3 months ago
- ☆79Updated last year
- ☆62Updated 2 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆106Updated 2 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆140Updated last year
- ☆27Updated 4 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆103Updated last year
- ☆53Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆127Updated 4 months ago
- Domestic environment sound event detection task☆129Updated 5 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 3 months ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- Source code for Consistent ensemble distillation for audio tagging☆16Updated 4 months ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆60Updated last month
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- ☆29Updated 3 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆143Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆35Updated 2 years ago
- Baseline of DCASE 2020 task 4☆42Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆64Updated 2 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆98Updated last year
- Code for DCASE 2020 task 1a and task 1b.☆85Updated 2 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆128Updated 10 months ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- A PyTorch implementation of End-to-End Neural Diarization☆98Updated last year