JarodMica / chatterboxLinks
SoTA open-source TTS
☆23Updated 5 months ago
Alternatives and similar repositories for chatterbox
Users that are interested in chatterbox are comparing it to the libraries listed below
Sorting:
- ☆181Updated last year
- ☆289Updated 4 months ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆283Updated 2 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆187Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 4 months ago
- Fine Tune the Style-TTS2 Voice Model☆263Updated 5 months ago
- SoTA open-source TTS☆115Updated 6 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆634Updated 8 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆229Updated 5 months ago
- Examples of using the llasa-tts models locally☆181Updated 7 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆45Updated 2 months ago
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆778Updated this week
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆174Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆216Updated 7 months ago
- finetune llm part for spark-tts model☆112Updated 8 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆213Updated 2 weeks ago
- Realtime demo, Streaming and Finetuning code for CSM☆421Updated 2 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 6 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆255Updated last year
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆21Updated last week
- This app creates or read parquet dataset☆30Updated 7 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆291Updated 6 months ago
- ☆71Updated 8 months ago
- a Frontier Japanese Speech Generation net☆59Updated 6 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆744Updated last week
- ☆338Updated 2 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆80Updated last year
- ☆250Updated 6 months ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆334Updated 4 months ago
- G2P☆368Updated 4 months ago