A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
☆844Feb 14, 2026Updated 3 weeks ago
Alternatives and similar repositories for LuxTTS
Users that are interested in LuxTTS are comparing it to the libraries listed below
Sorting:
- Inference server for MioTTS, a lightweight and fast LLM-based TTS model.☆103Feb 14, 2026Updated 3 weeks ago
- Fast audio super resolution from 16khz to 48khz.☆199Jan 3, 2026Updated 2 months ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,197Jan 15, 2026Updated last month
- A highly compressive and high-quality neural audio codec for speech models.☆257Jan 23, 2026Updated last month
- A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automat…☆340Updated this week
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆307Dec 15, 2025Updated 2 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆41Feb 17, 2026Updated 3 weeks ago
- ☆454Nov 2, 2025Updated 4 months ago
- A lightning fast audio upsampler.☆737Feb 26, 2026Updated last week
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆111Feb 21, 2026Updated 2 weeks ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆6,078Mar 3, 2026Updated last week
- 🌋LavaSR: Fast Speech restoration and enhancement☆390Mar 2, 2026Updated last week
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆306Nov 5, 2025Updated 4 months ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- ☆201Feb 3, 2026Updated last month
- [ICASSP'26] Real-time streaming voice anonymization & voice conversion☆57Feb 9, 2026Updated last month
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18May 16, 2023Updated 2 years ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆872Dec 2, 2025Updated 3 months ago
- [CVPR 2026] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆442Feb 25, 2026Updated last week
- ☆280Jan 8, 2026Updated 2 months ago
- Vid Driven Portrait Animation 🤢😷☆18Jul 7, 2024Updated last year
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆211Jan 13, 2026Updated last month
- HeartMuLa Official Repo: The Most Powerful Open-Source Music Generation Model of 2026☆4,209Updated this week
- A TTS that fits in your CPU (and pocket)☆3,430Mar 1, 2026Updated last week
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 3 months ago
- Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…☆9,027Feb 6, 2026Updated last month
- Multi-AI documentation for OpenClaw: architecture, security audits, deployment guide☆135Updated this week
- Audiobook creation tool with support for multiple TTS models (Qwen3-TTS, MiraTTS, GLM-TTS, IndexTTS2, VibeVoice, Higgs V2, Fish S1-mini, …☆77Feb 27, 2026Updated last week
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- This suite of nodes unlocks high-performance parallel processing in ComfyUI by utilizing **Model Replication**. Unlike standard offloadin…☆41Feb 24, 2026Updated 2 weeks ago
- 🔍 Bug Bounty Search Engine - Advanced reconnaissance toolkit with 64+ Google dork queries organized into 10 categories for security rese…☆40Oct 6, 2025Updated 5 months ago
- ☆12Jun 17, 2019Updated 6 years ago
- Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations☆880Feb 23, 2026Updated 2 weeks ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆503Aug 20, 2025Updated 6 months ago
- A Japanese G2P tool based on pyopenjtalk☆25Aug 6, 2022Updated 3 years ago
- Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smar…☆363Jan 25, 2026Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆659Jan 21, 2026Updated last month
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year