[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication
☆418Sep 19, 2025Updated 5 months ago
Alternatives and similar repositories for omnitalker
Users that are interested in omnitalker are comparing it to the libraries listed below
Sorting:
- project page for ChatAnyone☆116Mar 28, 2025Updated 11 months ago
- ☆3,185Dec 19, 2025Updated 2 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,622Jan 26, 2026Updated last month
- 🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.☆12,392Oct 16, 2025Updated 4 months ago
- ☆421Jun 30, 2025Updated 8 months ago
- Taming Stable Diffusion for Lip Sync!☆5,441Jun 20, 2025Updated 8 months ago
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆3,193Jan 8, 2026Updated last month
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆5,366Sep 26, 2025Updated 5 months ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆4,490Feb 23, 2026Updated last week
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆445Aug 20, 2025Updated 6 months ago
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆714Nov 12, 2025Updated 3 months ago
- 每个人都能用的数字人☆1,854Nov 8, 2025Updated 3 months ago
- Real time interactive streaming digital human☆7,146Feb 24, 2026Updated last week
- [CVPR 2025] This is the official source for our paper "DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations"☆55Jul 12, 2025Updated 7 months ago
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆4,185Aug 5, 2025Updated 6 months ago
- 一个超轻量级、可以在移动端实时运行的数字人模型☆2,421Sep 18, 2025Updated 5 months ago
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- [SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head☆929Sep 11, 2025Updated 5 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆460Nov 10, 2025Updated 3 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆811Oct 16, 2024Updated last year
- talking-face video editing☆423Feb 27, 2025Updated last year
- The official SpeakerVid-5M data curation code.☆68Jul 23, 2025Updated 7 months ago
- [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆785Updated this week
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,654Mar 10, 2025Updated 11 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆4,870Dec 18, 2025Updated 2 months ago
- Spark-TTS Inference Code☆10,943Apr 9, 2025Updated 10 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆376Jan 23, 2026Updated last month
- Memory-Guided Diffusion for Expressive Talking Video Generation☆1,073Aug 6, 2025Updated 6 months ago
- ☆486May 6, 2025Updated 9 months ago
- ☆6,072Aug 29, 2025Updated 6 months ago
- ☆4,613Feb 13, 2026Updated 2 weeks ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,488Sep 11, 2025Updated 5 months ago
- 实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning,…☆1,208Dec 18, 2025Updated 2 months ago
- AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。☆4,614Feb 7, 2026Updated 3 weeks ago
- ☆17Sep 5, 2024Updated last year
- Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LL…☆3,141Feb 10, 2026Updated 3 weeks ago
- ☆657Nov 18, 2025Updated 3 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆19,695Feb 11, 2026Updated 2 weeks ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆584Jun 5, 2025Updated 8 months ago