paperwave / VibeVoiceLinks
Frontier Open-Source Text-to-Speech
☆95Updated 3 months ago
Alternatives and similar repositories for VibeVoice
Users that are interested in VibeVoice are comparing it to the libraries listed below
Sorting:
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆290Updated 2 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆786Updated last week
- The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement☆690Updated 2 weeks ago
- ☆532Updated 2 months ago
- ☆374Updated last month
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆870Updated last week
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆78Updated 5 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated last month
- ComfyUI node for highly expressive speech and realistic zero-shot voice cloning☆350Updated this week
- VLLM Port of the Chatterbox TTS model☆351Updated 2 months ago
- ☆472Updated 7 months ago
- ☆135Updated 9 months ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆176Updated 6 months ago
- ☆289Updated 4 months ago
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆291Updated 3 months ago
- Examples of using the llasa-tts models locally☆182Updated 8 months ago
- Gradio UI for YuE☆84Updated 8 months ago
- YuE with mp3 extend, exllama and GUI☆64Updated 9 months ago
- The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment☆1,002Updated last week
- ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio☆536Updated 2 months ago
- SoTA open-source TTS☆136Updated this week
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆722Updated this week
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆289Updated last month
- A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Step Audio EditX, I…☆470Updated this week
- Streaming and Fine-tuning for Chatterbox TTS☆237Updated 6 months ago
- ☆701Updated last month
- A high quality and fast TTS repository☆111Updated this week
- GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters☆570Updated last week
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆324Updated last week
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆298Updated 2 months ago