ishandutta2007 / Text-to-Speech-LandscapeLinks

☆71

Alternatives and similar repositories for Text-to-Speech-Landscape

Users that are interested in Text-to-Speech-Landscape are comparing it to the libraries listed below

Sorting:

Rumeysakeskin / Speaker-Verification
Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, Speake…
☆40Updated 2 years ago
IEEE-NITK / Neural-Voice-Cloning
Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…
☆57Updated 6 years ago
CSTR-Edinburgh / Ossian
☆58Updated 6 years ago
Kyubyong / expressive_tacotron
Tensorflow Implementation of Expressive Tacotron
☆196Updated 7 years ago
uiuc-sst / asr24
24-hour Automatic Speech Recognition
☆27Updated 4 years ago
smoke-trees / Voice-synthesis
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…
☆171Updated 5 years ago
resemble-ai / MelNet
WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆256Updated 6 years ago
klintan / swedish-asr-dataset
Jupyter Notebooks for creating Speech datasets
☆46Updated 6 years ago
aishoot / Multi-Hotword_Spotting
Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
☆33Updated 7 years ago
eazhary / dctts2
Deep Convolution Text to Speech
☆34Updated 7 years ago
anjandeepsahni / speech_phoneme_prediction
Phoneme prediction from speech mel-spectrograms using RNN.
☆15Updated 6 years ago
erogol / WaveRNN
Pytorch implementation of Deepmind's WaveRNN model
☆123Updated 6 years ago
tiberiu44 / TTS-Cube
End-2-end speech synthesis with recurrent neural networks
☆224Updated last year
ttaoREtw / Tacotron-pytorch
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
☆110Updated 5 years ago
candlewill / Griffin_lim
A TensorFlow implementation of Griffin-Lim algorithm
☆79Updated 7 years ago
Sharad24 / Neural-Voice-Cloning-with-Few-Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
☆253Updated 4 years ago
khuangaf / ITRI-speech-recognition-dataset-generation
Automatic Speech Recognition Dataset Generation
☆37Updated 7 years ago
silenterus / deepspeech-cleaner
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
☆48Updated 2 years ago
fatchord / FFTNet
Pytorch Implementation of FFTNet
☆87Updated 7 years ago
akashmjn / cs224n-gpu-that-talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
☆52Updated 6 years ago
nii-yamagishilab / multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
☆266Updated 3 years ago
jcsilva / multilingual-g2p
Multilingual Grapheme to Phoneme
☆50Updated 9 years ago
AI4Bharat / NPTEL2020-Indian-English-Speech-Dataset
NPTEL2020: Speech2Text dataset for Indian-English Accent
☆80Updated 4 years ago
candlewill / Speech-Corpus-Collection
A Collection of Speech Corpus for ASR and TTS
☆113Updated 8 years ago
ynop / audiomate
Python library for handling audio datasets.
☆138Updated 2 years ago
JohannesBuchner / spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…
☆70Updated 8 years ago
alokprasad / fastspeech_squeezewave
Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave
☆20Updated 2 years ago
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
☆107Updated 2 years ago
asappresearch / sew
☆76Updated 4 years ago
CSTR-Edinburgh / magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
☆80Updated 6 years ago