thorstenMueller / Thorsten-VoiceLinks
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
☆694Updated last week
Alternatives and similar repositories for Thorsten-Voice
Users that are interested in Thorsten-Voice are comparing it to the libraries listed below
Sorting:
- Automatic Speech Recognition (ASR) - German☆319Updated 2 years ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆510Updated 2 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆1,054Updated last year
- Voice models for Mimic 3 text to speech system☆162Updated last year
- 🐸STT integration examples☆130Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆332Updated last year
- 🐸 collection of TTS papers☆723Updated last year
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆844Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆254Updated 3 years ago
- A fast local neural text to speech engine for Mycroft☆1,245Updated 10 months ago
- Docker image for Mozilla TTS server☆202Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.☆420Updated 2 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆175Updated 2 years ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,558Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆260Updated 2 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,384Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆376Updated 2 years ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆586Updated 4 years ago
- Open Text to Speech Server☆1,119Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆215Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆569Updated 2 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆701Updated 3 years ago
- Simple text to phones converter for multiple languages☆1,511Updated last year
- Official Implementation of StyleTTS☆460Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆540Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆358Updated 2 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆705Updated last year
- Pre-trained Precise models and training data provided by the Mycroft Community☆51Updated 4 years ago
- Finetune VITS and MMS using HuggingFace's tools☆191Updated last year
- The code for the bark-voicecloning model. Training and inference.☆710Updated 2 years ago