A high quality and fast TTS repository
☆511Dec 22, 2025Updated 5 months ago
Alternatives and similar repositories for MiraTTS
Users that are interested in MiraTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast audio super resolution from 16khz to 48khz.☆209Jan 3, 2026Updated 4 months ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,234Jan 15, 2026Updated 4 months ago
- A highly compressive and high-quality neural audio codec for speech models.☆266Jan 23, 2026Updated 4 months ago
- Echo-TTS inference codebase☆189Dec 5, 2025Updated 5 months ago
- A lightning fast audio upsampler.☆768Feb 26, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Create Unmute voice embeddings☆25Nov 15, 2025Updated 6 months ago
- Interface for OuteTTS models.☆1,432Mar 23, 2026Updated 2 months ago
- VLLM Port of the Chatterbox TTS model☆379Oct 18, 2025Updated 7 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆118Nov 24, 2025Updated 6 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated last year
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆98May 18, 2026Updated last week
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆165Nov 30, 2025Updated 6 months ago
- ☆361Aug 28, 2025Updated 9 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A refeference of text models that can be used in the AI Horde☆12May 22, 2026Updated last week
- [ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates☆49Apr 13, 2026Updated last month
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆230Jan 13, 2026Updated 4 months ago
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Example repo showcasing model training and deployment with distil claude cli skill☆55Jan 19, 2026Updated 4 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆314May 31, 2025Updated 11 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆101Jan 19, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Realtime demo, Streaming and Finetuning code for CSM☆455Sep 17, 2025Updated 8 months ago
- ☆21Feb 14, 2026Updated 3 months ago
- Generate high resolution videos with a custom voice and appearance, based on LTX-2/LTX-2.3 + Identity In-Context LoRA☆303Mar 24, 2026Updated 2 months ago
- ☆18Sep 19, 2023Updated 2 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆20May 12, 2023Updated 3 years ago
- Wake word detection with custom phrases without model training☆47Mar 8, 2026Updated 2 months ago
- hentai game manager, mostly for f95, but might add other sites later☆12Nov 4, 2025Updated 6 months ago
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆207Apr 7, 2026Updated last month
- Towards Human-Sounding Speech☆6,148Dec 5, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Large Scale Benchmark of Large Language Models on African Languages☆19Jul 28, 2025Updated 10 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆169Oct 20, 2025Updated 7 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆87Nov 12, 2024Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆30Apr 16, 2024Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆355Jul 21, 2025Updated 10 months ago
- VyvoTTS: LLM-Based Text-to-Speech Training Framework☆256Apr 8, 2026Updated last month