nxneo / LatentsyncRealtimeLinks
This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human live streaming. It features a robust multi-process architecture, seamless idle/formal stream switching with smooth crossfade transitions, and is optimized for real-world deployment scenarios.
☆18Updated 2 months ago
Alternatives and similar repositories for LatentsyncRealtime
Users that are interested in LatentsyncRealtime are comparing it to the libraries listed below
Sorting:
- project page for ChatAnyone☆114Updated 6 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆76Updated 5 months ago
- 基于MuseTalk的数字人代码。☆31Updated last year
- Just a suturing monster project.☆41Updated last year
- 优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs☆79Updated 10 months ago
- 开源的LstmSync数字人泛化模型,只做最好的泛化模型!☆115Updated this week
- ☆78Updated 3 months ago
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆50Updated 6 months ago
- python库,实现推送实时rtmp音视频流☆133Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 8 months ago
- ☆31Updated 2 months ago
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆105Updated last year
- ☆42Updated last year
- 在DH_live项目基础上修改,添加webui界面☆71Updated 5 months ago
- ☆28Updated 2 years ago
- Web UI for OpenAvatarChat☆33Updated last month
- wav2lip384生成器网格权重——来自不蠢不蠢☆132Updated 7 months ago
- ☆372Updated 3 months ago
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆171Updated last year
- DH-Live-Web-UI☆18Updated last year
- ☆59Updated 2 years ago
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆25Updated last year
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆45Updated last year
- 主要写er-nerf从零到一所有部署过程☆43Updated last year
- ☆75Updated last year
- A docker free offline version for HeyGem; Python and Linux is all you need!☆347Updated 2 months ago
- EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆571Updated last month
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆106Updated last year
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆64Updated 3 weeks ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆54Updated last year