Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆2,763Jan 22, 2026Updated 2 months ago
Alternatives and similar repositories for supertonic
Users that are interested in supertonic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS model capable of streaming conversational audio in realtime.☆1,104Nov 29, 2025Updated 4 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,220Nov 19, 2025Updated 4 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- ☆100Jan 19, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,207Jan 15, 2026Updated 2 months ago
- Towards Human-Sounding Speech☆6,037Dec 5, 2025Updated 3 months ago
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆94Jan 14, 2026Updated 2 months ago
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,198Mar 5, 2025Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆108Mar 15, 2026Updated 2 weeks ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆48Sep 2, 2025Updated 6 months ago
- A TTS that fits in your CPU (and pocket)☆3,620Mar 12, 2026Updated 2 weeks ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- SOTA Open Source TTS☆28,887Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- On-device TTS model by Neuphonic☆5,065Mar 23, 2026Updated last week
- https://hf.co/hexgrad/Kokoro-82M☆6,073Aug 6, 2025Updated 7 months ago
- SoTA open-source TTS☆23,922Mar 18, 2026Updated last week
- Trainging, inference, and testing of the SAC speech codec model.☆100Nov 1, 2025Updated 4 months ago
- State-of-the-art TTS model under 25MB 😻☆13,015Mar 19, 2026Updated last week
- Interface for OuteTTS models.☆1,431Mar 23, 2026Updated last week
- ☆453Nov 2, 2025Updated 4 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆88Feb 3, 2026Updated last month
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,581Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆929Dec 2, 2025Updated 3 months ago
- Inference and training library for high-quality TTS models.☆5,558Dec 10, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,227Aug 10, 2024Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,254Updated this week
- Open-Source Frontier Voice AI☆24,019Updated this week
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆115Nov 24, 2025Updated 4 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆154Jan 27, 2026Updated 2 months ago
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆65Nov 17, 2025Updated 4 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,896Aug 16, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…☆1,218Mar 23, 2026Updated last week
- Controllable and fast Text-to-Speech for over 7000 languages!☆2,194Jan 25, 2026Updated 2 months ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆6,197Mar 13, 2026Updated 2 weeks ago
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆960Dec 17, 2025Updated 3 months ago
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆11,155Updated this week
- UTokyo-SaruLab MOS Prediction System☆305Feb 23, 2026Updated last month
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year