Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feature computations & data augmentations.
☆15Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for speech-datasets
Users that are interested in speech-datasets are comparing it to the libraries listed below
Sorting:
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 3 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- ☆32Jul 27, 2022Updated 3 years ago
- An SSH implemenation in pure Haskell☆17Feb 14, 2022Updated 4 years ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Dec 8, 2021Updated 4 years ago
- Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper☆14Apr 3, 2025Updated 11 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 9 months ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- ☆13Sep 21, 2022Updated 3 years ago
- A real time implementation of the ddsp from google magenta.☆15Nov 8, 2021Updated 4 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- Simple DSL that comiles to BPF assembly☆17Apr 19, 2018Updated 7 years ago
- Speech in Flax/JAX☆15Jul 11, 2022Updated 3 years ago
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis☆69Aug 3, 2021Updated 4 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Feb 11, 2023Updated 3 years ago
- ☆18Apr 12, 2021Updated 4 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- ☆46Nov 2, 2023Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45May 25, 2021Updated 4 years ago
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- Haskell + nixpkgs = nix-hs☆24Jun 2, 2021Updated 4 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- A simple universal data description format for datasets, tailored for interfacing with humans.☆25Feb 16, 2021Updated 5 years ago
- ☆26Nov 19, 2020Updated 5 years ago
- Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"☆192Dec 8, 2022Updated 3 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Simple reflection of expressions☆34Jun 18, 2021Updated 4 years ago
- ☆21Aug 29, 2019Updated 6 years ago
- nix scripts for pytorch-related libraries☆22Apr 25, 2021Updated 4 years ago
- Generating drum loops using the Wave-U-Net conditioned on intuitive parameters.☆24Nov 19, 2020Updated 5 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago