shangeth / wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
☆89Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for wavencoder
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- ☆46Updated 4 years ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆73Updated 3 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆111Updated last year
- The implementation of "A Recursive Network with Dynamic Attention for Monaural Speech Enhancement"☆77Updated last year
- Domestic environment sound event detection task☆129Updated 5 months ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆115Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆123Updated 2 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆94Updated 3 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆52Updated last year
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆111Updated 5 months ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆98Updated 4 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- MultiSV: scripts for data preparation☆25Updated last week
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- Speech Dereverberation using Fully Convolutional Networks☆68Updated 4 years ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Baseline of DCASE 2020 task 4☆42Updated 2 years ago
- Constrained Permutation Invariant Training, Speech Separation☆43Updated 3 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆98Updated last year
- ☆105Updated 3 years ago
- Repo associated to the DESED dataset, download and creation of data☆128Updated 4 months ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆139Updated last year
- ☆59Updated 4 years ago