Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
☆2,896May 6, 2026Updated this week
Alternatives and similar repositories for supertonic
Users that are interested in supertonic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS model capable of streaming conversational audio in realtime.☆1,120Nov 29, 2025Updated 5 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆37Sep 9, 2025Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,294Nov 19, 2025Updated 5 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 8 months ago
- ☆101Jan 19, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Towards Human-Sounding Speech☆6,127Dec 5, 2025Updated 5 months ago
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,197Mar 5, 2025Updated last year
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,225Jan 15, 2026Updated 3 months ago
- A TTS that fits in your CPU (and pocket)☆4,109May 2, 2026Updated last week
- SoTA open-source TTS☆24,559May 1, 2026Updated last week
- https://hf.co/hexgrad/Kokoro-82M☆6,871Aug 6, 2025Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆108Mar 15, 2026Updated last month
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 8 months ago
- State-of-the-art TTS model under 25MB 😻☆13,721Mar 27, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Interface for OuteTTS models.☆1,429Mar 23, 2026Updated last month
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆108Jan 17, 2025Updated last year
- SOTA Open Source TTS☆30,158Updated this week
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆99Mar 31, 2026Updated last month
- [ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.☆101Nov 1, 2025Updated 6 months ago
- On-device TTS model by Neuphonic☆5,766Apr 24, 2026Updated 2 weeks ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,247Aug 10, 2024Updated last year
- ☆462Nov 2, 2025Updated 6 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆95Apr 3, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Inference and training library for high-quality TTS models.☆5,577Dec 10, 2024Updated last year
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,993Mar 26, 2026Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,442Apr 20, 2026Updated 2 weeks ago
- UTokyo-SaruLab MOS Prediction System☆315Apr 2, 2026Updated last month
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆974Dec 2, 2025Updated 5 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆119Nov 24, 2025Updated 5 months ago
- Open-Source Frontier Voice AI☆46,529Apr 24, 2026Updated 2 weeks ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆159Jan 27, 2026Updated 3 months ago
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆66Nov 17, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆12,116Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆45,208Aug 16, 2024Updated last year
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,111Apr 28, 2026Updated last week
- Controllable and fast Text-to-Speech for over 7000 languages!☆2,199Jan 25, 2026Updated 3 months ago
- MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…☆1,310Mar 23, 2026Updated last month
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆994Apr 10, 2026Updated 3 weeks ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year