shangeth / wavencoderLinks
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
☆90Updated 3 years ago
Alternatives and similar repositories for wavencoder
Users that are interested in wavencoder are comparing it to the libraries listed below
Sorting:
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated last year
- fast SpecAugmentation code with numpy and scipy☆31Updated 5 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- ☆53Updated 5 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆140Updated 2 years ago
- The official repository for Audio ALBERT☆65Updated 3 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆94Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆32Updated 6 years ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆64Updated 8 months ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆103Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 5 months ago
- ☆29Updated 4 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆41Updated 2 years ago
- ☆52Updated 4 years ago
- Baseline of DCASE 2020 task 4☆43Updated 2 years ago
- ☆65Updated 8 months ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 3 years ago
- Implementation of audio degradation processes☆102Updated 9 years ago
- CP-JKU submission to DCASE 20☆44Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 11 months ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆74Updated 4 years ago