jim-schwoebel / voice_datasetsLinks
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
☆2,010Updated last year
Alternatives and similar repositories for voice_datasets
Users that are interested in voice_datasets are comparing it to the libraries listed below
Sorting:
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,357Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,797Updated last month
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,616Updated last year
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,452Updated 3 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,453Updated last month
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆995Updated 3 months ago
- List of speech synthesis papers.☆1,057Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversion☆877Updated 2 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆836Updated 8 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- The Implementation of FastSpeech based on pytorch.☆876Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,132Updated last week
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,196Updated 4 years ago
- Simple text to phones converter for multiple languages☆1,457Updated 11 months ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆933Updated last year
- 🐸 collection of TTS papers