salesforce / speech-datasets
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feature computations & data augmentations.
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speech-datasets
- Proposed splits for the LREC Wikipron paper☆13Updated 4 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆22Updated 3 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆20Updated last year
- Fast and differentiable hidden Markov model in C++☆15Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 2 weeks ago
- ☆56Updated last year
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆51Updated 2 years ago
- Viterbi decoding in PyTorch☆27Updated last month
- Prosodic Speech Segmentation with Transformers☆23Updated 9 months ago
- ☆42Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆32Updated 2 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Temporary anonymous version☆22Updated 8 months ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- ☆15Updated this week
- Implementation of Google's USM speech model in Pytorch☆25Updated 2 weeks ago
- A library of speech gadgets.☆13Updated 2 years ago
- A JAX library for building lattice-based speech transducer models☆40Updated last month
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆62Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- ☆56Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Updated last year
- ☆12Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago