salesforce / speech-datasets
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feature computations & data augmentations.
☆15Updated last year
Alternatives and similar repositories for speech-datasets:
Users that are interested in speech-datasets are comparing it to the libraries listed below
- A JAX library for building lattice-based speech transducer models☆41Updated last month
- ☆22Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 11 months ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated last week
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- ☆12Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated 11 months ago
- ☆42Updated 2 years ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆20Updated last year
- Fast and differentiable hidden Markov model in C++☆16Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Speech in Flax/JAX☆15Updated 2 years ago
- A collection of utilities for handling IPA phones.☆26Updated last year
- Implementation of Google's USM speech model in Pytorch☆27Updated this week
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- ☆56Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated 2 years ago
- A collection of papers related to speech model compression☆24Updated last year
- Pytorch Implementation of WaveNODE☆64Updated 4 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- A library of speech gadgets.☆13Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 10 months ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Updated 3 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆65Updated 2 years ago
- Viterbi decoding in PyTorch☆27Updated 3 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Temporary anonymous version☆22Updated 10 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 3 years ago