A high quality and fast TTS repository
☆512Dec 22, 2025Updated 5 months ago
Alternatives and similar repositories for MiraTTS
Users that are interested in MiraTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast audio super resolution from 16khz to 48khz.☆212Jan 3, 2026Updated 5 months ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,235Jan 15, 2026Updated 5 months ago
- A highly compressive and high-quality neural audio codec for speech models.☆268Jan 23, 2026Updated 4 months ago
- Echo-TTS inference codebase☆194Dec 5, 2025Updated 6 months ago
- A lightning fast audio upsampler.☆776Feb 26, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Interface for OuteTTS models.☆1,433Mar 23, 2026Updated 2 months ago
- Create Unmute voice embeddings☆26Nov 15, 2025Updated 7 months ago
- VLLM Port of the Chatterbox TTS model☆377Oct 18, 2025Updated 8 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆119Nov 24, 2025Updated 6 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated last year
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆100May 18, 2026Updated last month
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆168Nov 30, 2025Updated 6 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A refeference of text models that can be used in the AI Horde☆12May 31, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates☆49Apr 13, 2026Updated 2 months ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆232Jan 13, 2026Updated 5 months ago
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Example repo showcasing model training and deployment with distil claude cli skill☆54Jan 19, 2026Updated 5 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 4 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆313May 31, 2025Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆101Jan 19, 2026Updated 5 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆454Sep 17, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Generate high resolution videos with a custom voice and appearance, based on LTX-2/LTX-2.3 + Identity In-Context LoRA☆323Mar 24, 2026Updated 2 months ago
- ☆18Sep 19, 2023Updated 2 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆20May 12, 2023Updated 3 years ago
- Wake word detection with custom phrases without model training☆54Mar 8, 2026Updated 3 months ago
- hentai game manager, mostly for f95, but might add other sites later☆13Updated this week
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆207Jun 11, 2026Updated last week
- Towards Human-Sounding Speech☆6,185Dec 5, 2025Updated 6 months ago
- Large Scale Benchmark of Large Language Models on African Languages☆19Jul 28, 2025Updated 10 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆172Oct 20, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆87Nov 12, 2024Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆30Apr 16, 2024Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 3 years ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆358Jul 21, 2025Updated 10 months ago
- VyvoTTS: LLM-Based Text-to-Speech Training Framework☆257Apr 8, 2026Updated 2 months ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 9 months ago
- ☆23Feb 14, 2026Updated 4 months ago