Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆2,810Jan 22, 2026Updated 2 months ago
Alternatives and similar repositories for supertonic
Users that are interested in supertonic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS model capable of streaming conversational audio in realtime.☆1,114Nov 29, 2025Updated 4 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆37Sep 9, 2025Updated 7 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,256Nov 19, 2025Updated 5 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- ☆100Jan 19, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,220Jan 15, 2026Updated 3 months ago
- Towards Human-Sounding Speech☆6,088Dec 5, 2025Updated 4 months ago
- A TTS that fits in your CPU (and pocket)☆3,926Apr 8, 2026Updated last week
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,188Mar 5, 2025Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆108Mar 15, 2026Updated last month
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 7 months ago
- State-of-the-art TTS model under 25MB 😻☆13,524Mar 27, 2026Updated 3 weeks ago
- https://hf.co/hexgrad/Kokoro-82M☆6,540Aug 6, 2025Updated 8 months ago
- Interface for OuteTTS models.☆1,430Mar 23, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- SoTA open-source TTS☆24,298Mar 26, 2026Updated 3 weeks ago
- SOTA Open Source TTS☆29,257Apr 6, 2026Updated last week
- On-device TTS model by Neuphonic☆5,147Mar 23, 2026Updated 3 weeks ago
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆98Mar 31, 2026Updated 2 weeks ago
- [ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.☆100Nov 1, 2025Updated 5 months ago
- ☆458Nov 2, 2025Updated 5 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆91Apr 3, 2026Updated 2 weeks ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,238Aug 10, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,819Mar 26, 2026Updated 3 weeks ago
- Inference and training library for high-quality TTS models.☆5,565Dec 10, 2024Updated last year
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆953Dec 2, 2025Updated 4 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,337Updated this week
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆116Nov 24, 2025Updated 4 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆156Jan 27, 2026Updated 2 months ago
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆66Nov 17, 2025Updated 5 months ago
- Open-Source Frontier Voice AI☆39,575Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆45,043Aug 16, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆11,658Updated this week
- Controllable and fast Text-to-Speech for over 7000 languages!☆2,198Jan 25, 2026Updated 2 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,010Mar 4, 2026Updated last month
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆970Apr 10, 2026Updated last week
- MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…☆1,271Mar 23, 2026Updated 3 weeks ago
- UTokyo-SaruLab MOS Prediction System☆309Apr 2, 2026Updated 2 weeks ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year