nxneo / LatentsyncRealtimeLinks
This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human live streaming. It features a robust multi-process architecture, seamless idle/formal stream switching with smooth crossfade transitions, and is optimized for real-world deployment scenarios.
☆14Updated last week
Alternatives and similar repositories for LatentsyncRealtime
Users that are interested in LatentsyncRealtime are comparing it to the libraries listed below
Sorting:
- Just a suturing monster project.☆41Updated last year
- project page for ChatAnyone☆110Updated 5 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆63Updated 3 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆170Updated 6 months ago
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆46Updated 4 months ago
- 开源的LstmSync数字人泛化模型,只做最好的泛化模型!☆87Updated this week
- 基于MuseTalk的数字人代码。☆31Updated 11 months ago
- ☆42Updated last year
- 优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs☆77Updated 8 months ago
- ☆28Updated last year
- ☆76Updated last month
- DH-Live-Web-UI☆18Updated 11 months ago
- python库,实现推送实时rtmp音视频流☆129Updated last year
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆25Updated 10 months ago
- Lightning-responsive CosyVoice2 streaming API based on FastAPI.☆15Updated 3 months ago
- Added vLLM support to IndexTTS for faster inference.☆424Updated 2 weeks ago
- ☆341Updated last month
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆55Updated last year
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆105Updated last year
- 在DH_live项目基础上修改,添加webui界面☆64Updated 4 months ago
- 基于wav2lip进行虚拟数字人训练,唇形驱动,包括数据处理流程等,模型包括96x96,192x192,192x288,288x288。☆20Updated last year
- 数字人授课录制系统——全新的微课视频的生成方案——API☆59Updated 7 months ago
- EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆317Updated last week
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆245Updated 3 weeks ago
- wav2lip384生成器网格权重——来自不蠢不蠢☆120Updated 5 months ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆34Updated 11 months ago
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆43Updated 10 months ago
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆168Updated 10 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆98Updated 11 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆22Updated 10 months ago