nxneo / LatentsyncRealtimeLinks
This project provides a production-ready, real-time inference server for LatentSync, enabling high-quality, low-latency 2D digital human live streaming. It features a robust multi-process architecture, seamless idle/formal stream switching with smooth crossfade transitions, and is optimized for real-world deployment scenarios.
☆19Updated 2 months ago
Alternatives and similar repositories for LatentsyncRealtime
Users that are interested in LatentsyncRealtime are comparing it to the libraries listed below
Sorting:
- Just a suturing monster project.☆41Updated last year
- project page for ChatAnyone☆115Updated 7 months ago
- 优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs☆80Updated 10 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆78Updated 6 months ago
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆52Updated 6 months ago
- Web UI for OpenAvatarChat☆43Updated 2 months ago
- DH-Live-Web-UI☆18Updated last year
- 开源的LstmSync数字人泛化模型,只做最好的泛化模型!☆120Updated last week
- python库,实现推送实时rtmp音视频流☆134Updated last year
- ☆385Updated 4 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 9 months ago
- 在DH_live项目基础上修改,添加webui界面☆71Updated 6 months ago
- ☆42Updated last year
- ☆78Updated 3 months ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆54Updated last year
- 基于MuseTalk的数字人代码。☆31Updated last year
- ☆28Updated 2 years ago
- ☆31Updated 3 months ago
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆105Updated last year
- Realtime Video and Audio Streaming with WebRTC and Gradio☆73Updated 4 months ago
- ☆58Updated 2 years ago
- wav2lip384生成器网格权重——来自不蠢不蠢☆138Updated 8 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated last month
- 将Wav2Lip和GFPGAN进行结合实现高清数字人说话视频☆37Updated 5 months ago
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆172Updated last year
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆68Updated last month
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Updated last month
- 主要写er-nerf从零到一所有部署过程☆43Updated last year
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆25Updated last year
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆45Updated last year