ishandutta2007 / Text-to-Speech-Landscape
☆44Updated 3 years ago
Alternatives and similar repositories for Text-to-Speech-Landscape:
Users that are interested in Text-to-Speech-Landscape are comparing it to the libraries listed below
- Deep Convolution Text to Speech☆35Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- A Collection of Speech Corpus for ASR and TTS☆112Updated 7 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Multilingual Grapheme to Phoneme☆49Updated 8 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆30Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 2 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Grapheme To Phoneme☆70Updated 5 months ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- asr2k☆48Updated 7 months ago
- End-to-End Speech Recognition Using Tensorflow☆42Updated last year
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 5 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆14Updated 5 years ago
- Implementation of Multi speaker TTS☆50Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Updated last year
- Long audio alignment using Kaldi☆24Updated 3 years ago
- Real-time melgan based on cpu !!!☆13Updated 5 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Python API for reading and querying ARPA formatted language models.☆33Updated 10 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆25Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆63Updated 4 years ago