MycroftAI / mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
☆500Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mimic-recording-studio
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆580Updated 3 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆911Updated 2 weeks ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆667Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆830Updated last year
- Performant and accurate speech recognition built on Pytorch☆248Updated 2 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆200Updated 3 months ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- 🐸 collection of TTS papers☆640Updated 4 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,283Updated 5 months ago
- 🐸STT integration examples☆121Updated 2 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆430Updated 3 years ago
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆285Updated this week
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆639Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- Desktop application for neural speech synthesis written in C++☆210Updated last year
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆855Updated last year
- Voice models for Mimic 3 text to speech system☆131Updated 5 months ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…☆890Updated last year
- An opensource text-to-speech (TTS) voice building tool☆660Updated 3 months ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆647Updated 3 weeks ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆167Updated 4 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆328Updated 9 months ago
- Grapheme to phoneme conversion with deep learning.☆358Updated 11 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆1,960Updated 3 months ago
- Examples of how to use or integrate DeepSpeech☆821Updated last year
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech☆331Updated 2 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,003Updated 3 weeks ago
- ☆251Updated last year