index-tts / index-tts2.github.ioLinks
The showcase page of IndexTTS2
☆47Updated 2 weeks ago
Alternatives and similar repositories for index-tts2.github.io
Users that are interested in index-tts2.github.io are comparing it to the libraries listed below
Sorting:
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆380Updated this week
- LLM voice chat project by Connect ChatTTS with Local Ollama, 连接本地部署的 Ollama 和 ChatTTS,实现和LLM的语音对话☆62Updated 11 months ago
- ☆441Updated last month
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆382Updated this week
- ☆500Updated 3 weeks ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆337Updated last month
- Have a natural voice conversation with an LLM☆250Updated 7 months ago
- Modified version of Chatterbox that accepts text files as input and no character restrictions☆317Updated 2 weeks ago
- MaskGCT-Windows For Windows Users☆64Updated last month
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆286Updated 3 months ago
- A Fast TTS Engine☆525Updated 5 months ago
- ☆426Updated 2 months ago
- ☆38Updated 5 months ago
- ☆259Updated 10 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆232Updated 5 months ago
- project page for ChatAnyone☆111Updated 3 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆215Updated 3 months ago
- Service for testing out the new Qwen2.5 omni model☆54Updated 2 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆73Updated last week
- In-context subject-driven image generation while preserving foreground fidelity☆297Updated last month
- Official repository of "TryOffAnyone: Tiled Cloth Generation from a Dressed Person"☆177Updated 5 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆222Updated 2 months ago
- ☆101Updated this week
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆269Updated last month
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆448Updated 8 months ago
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆239Updated last week
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆98Updated 3 months ago
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆110Updated 3 weeks ago
- Kyutai with an "eye"☆207Updated 3 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆274Updated 3 months ago