pyyush / SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆74Updated 4 years ago
Alternatives and similar repositories for SpecAugment:
Users that are interested in SpecAugment are comparing it to the libraries listed below
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆128Updated 2 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆142Updated last year
- ☆81Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆113Updated 5 months ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆136Updated 2 years ago
- ☆63Updated 5 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆70Updated 2 years ago
- ☆29Updated 3 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- Repo associated to the DESED dataset, download and creation of data☆135Updated 7 months ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆213Updated last year
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆144Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- Domestic environment sound event detection task☆138Updated 8 months ago
- A simple package for Guided source separation (GSS)☆114Updated 9 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆85Updated 6 months ago
- Clustering-based methods for overlapping diarization☆75Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆102Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆139Updated 2 years ago
- Making Espnet easier to use☆54Updated 3 years ago
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆114Updated last year
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆139Updated 4 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆74Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆24Updated 7 months ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated last month
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆131Updated 2 months ago