πΈTTS recipes for different datasets
β88Jul 26, 2022Updated 3 years ago
Alternatives and similar repositories for TTS-recipes
Users that are interested in TTS-recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π Coqui's machine learning job schedulerβ31Sep 5, 2021Updated 4 years ago
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- Awesome stuff made by the Mycroft communityβ13Sep 16, 2021Updated 4 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Nov 25, 2025Updated 4 months ago
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- πΈSTT integration examplesβ130Sep 23, 2022Updated 3 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!β25Feb 17, 2025Updated last year
- Coqui Inference Engineβ40Aug 3, 2021Updated 4 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,390Jun 6, 2024Updated last year
- TTS Client for Coqui TTS serverβ13Jan 7, 2023Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Mar 24, 2023Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β331Nov 15, 2024Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24May 16, 2021Updated 4 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ262Nov 15, 2025Updated 4 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license sβ¦β710Feb 2, 2026Updated last month
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Sep 23, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- Open models for Coqui STTβ153May 9, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Mar 30, 2021Updated 5 years ago
- real-time speech enhanceβ17Jan 23, 2024Updated 2 years ago
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A library of speech gadgets.β14Oct 15, 2022Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Sep 10, 2021Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.β16Jul 22, 2021Updated 4 years ago
- β13Aug 7, 2021Updated 4 years ago
- Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"β25Apr 19, 2019Updated 6 years ago
- Simple but maybe too simple config management through python data classes. We use it for machine learning.β108Apr 12, 2023Updated 2 years ago
- plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhereβ13Updated this week
- Mozilla deepspeech server implemented in django.β49Jun 10, 2021Updated 4 years ago
- German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference supportβ26Jun 7, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Coqui STT (πΈSTT) based forced alignment toolβ13Feb 24, 2022Updated 4 years ago
- Control-software for Quadrupedal Robots, especially for SpotMicroβ21May 27, 2021Updated 4 years ago
- A streaming Speech to Text server using DeepSpeechβ16May 10, 2020Updated 5 years ago
- https://wavelandspeech.github.io/β10Jan 12, 2024Updated 2 years ago
- Open Source Speech Inferencing Libary for Indic Languagesβ12Apr 11, 2022Updated 3 years ago
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)β11Aug 12, 2020Updated 5 years ago
- Linguistic processing for Common Voiceβ58Jan 18, 2024Updated 2 years ago