fagenorn / handcrafted-persona-engineLinks
An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications.
β764Updated last month
Alternatives and similar repositories for handcrafted-persona-engine
Users that are interested in handcrafted-persona-engine are comparing it to the libraries listed below
Sorting:
- ππ§Έ A container of souls of AI waifu / virtual characters to bring them into our worlds, wishing to achieve Neuro-sama's altitude, complβ¦β826Updated this week
- A Fast TTS Engineβ502Updated 4 months ago
- β400Updated 2 weeks ago
- Interface for OuteTTS models.β1,283Updated last week
- InspireMusic: Music, Song, Audio Generation.β1,107Updated last week
- π§Έ Lobe Vidol - Making Virtual Idols Accessible for EveryOneβ742Updated 2 months ago
- DiβͺβͺRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusionβ1,671Updated last week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.β408Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.β227Updated 4 months ago
- An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech Systemβ2,259Updated this week
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabsβ842Updated 2 weeks ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,β¦β367Updated 2 weeks ago
- β1,129Updated this week
- Self-hosted voice chat with LLMsβ431Updated 3 months ago
- ACE-Step: A Step Towards Music Generation Foundation Modelβ2,288Updated last week
- β875Updated last month
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generationβ962Updated 2 weeks ago
- AnimeGamer: Infinite Anime Life Simulation with Next Game State Predictionβ315Updated last month
- Dive is an open-source MCP Host Desktop Application that seamlessly integrates with any LLMs supporting function calling capabilities. β¨β1,302Updated this week
- Generate Web Pages and Components with text prompts, with Local Models. (or Cloud Models, if you want) - now supports Thinking Models!β155Updated 3 weeks ago
- zero-shot voice conversion & singing voice conversion, with real-time supportβ2,544Updated last month
- β377Updated 3 weeks ago
- Local SRT/LLM/TTS Voicechatβ680Updated 7 months ago
- Run Orpheus 3B Locally With LM Studioβ414Updated 2 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), supporβ¦β218Updated this week
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple modelsβ387Updated last month
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YTβ344Updated last month
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β264Updated last month
- Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video geβ¦β510Updated 3 weeks ago
- Sesame CSM 1B Voice Cloningβ298Updated 2 months ago