zsxkib / HunyuanVideoLinks
☆29Updated 6 months ago
Alternatives and similar repositories for HunyuanVideo
Users that are interested in HunyuanVideo are comparing it to the libraries listed below
Sorting:
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆98Updated 3 months ago
- ☆67Updated 3 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆63Updated 3 months ago
- ☆38Updated 5 months ago
- ☆70Updated 8 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆205Updated last year
- A Gradio UI for XTTSv2 and RVC.☆159Updated last year
- Slightly improved official version for finetune xtts☆73Updated 9 months ago
- ☆65Updated 2 months ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆126Updated 3 weeks ago
- ☆117Updated 4 months ago
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆198Updated last month
- Webui for using XTTS and for finetuning it☆117Updated 9 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆48Updated 4 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆415Updated 5 months ago
- A TTS extension for oobabooga text WebUI☆32Updated last year
- Adds "idle prompting" after the user has been idle for some time to organically continue the conversation.☆18Updated 4 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆178Updated 3 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆50Updated 7 months ago
- The future of AI roleplay☆116Updated last month
- ☆30Updated 6 months ago
- Automated speech dataset creator☆155Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated 9 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆54Updated 8 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆100Updated last week
- ⚡ AI Avatar Factory is an interface for creating and managing AI avatars. ⚡☆60Updated this week
- ☆32Updated last week
- Prompt-based Evolutionary Nudity Iteration System☆129Updated 4 months ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆57Updated 5 months ago
- Examples of using the llasa-tts models locally☆176Updated 2 months ago