dongdongzi / metahuman-streamLinks
Real time streaming digital human based on nerf
☆17Updated last year
Alternatives and similar repositories for metahuman-stream
Users that are interested in metahuman-stream are comparing it to the libraries listed below
Sorting:
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆54Updated last year
- project page for ChatAnyone☆115Updated 8 months ago
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆106Updated last year
- ☆33Updated 9 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 9 months ago
- 基于MuseTalk的数字人代码。☆31Updated last year
- 开源的LstmSync数字人泛化模型,只做最好的泛化模型!☆129Updated this week
- Just a suturing monster project.☆42Updated 2 years ago
- ChatTTS HTTP API☆54Updated last year
- ☆42Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆52Updated 7 months ago
- Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆87Updated this week
- 优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs☆80Updated 11 months ago
- The project page of Diffutoon☆28Updated last year
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆98Updated last year
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆173Updated last year
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆194Updated 8 months ago
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆25Updated last year
- ☆41Updated last year
- ☆182Updated last week
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆144Updated 2 years ago
- This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…☆17Updated 2 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Updated last year
- 文本语料转训练集工具,txt转dataset☆94Updated last year
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆222Updated 7 months ago
- ☆58Updated 2 years ago
- ☆51Updated 2 years ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆115Updated last week
- AI Emoji Argue Agent 🚀 基于LangChain的开源表情包斗图Agent☆27Updated last year