klintan / swedish-asr-datasetLinks
Jupyter Notebooks for creating Speech datasets
☆46Updated 6 years ago
Alternatives and similar repositories for swedish-asr-dataset
Users that are interested in swedish-asr-dataset are comparing it to the libraries listed below
Sorting:
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- Deep Convolution Text to Speech☆35Updated 7 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- A phoneme-allophone database for many languages☆52Updated 5 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- A Collection of Speech Corpus for ASR and TTS☆114Updated 8 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf☆43Updated 7 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 7 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated last year
- Text-to-Speech tutorial at SLTU 2016☆35Updated 9 years ago
- A fast cnn-based vocoder☆78Updated 5 years ago
- ☆13Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 6 years ago