Alibaba-Quark / LiveAvatarLinks
Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"
☆1,750Updated last week
Alternatives and similar repositories for LiveAvatar
Users that are interested in LiveAvatar are comparing it to the libraries listed below
Sorting:
- PersonaLive! : Expressive Portrait Image Animation for Live Streaming☆1,612Updated last month
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆1,133Updated 2 weeks ago
- ☆2,024Updated last month
- ☆2,053Updated last month
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,204Updated 3 months ago
- [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆755Updated this week
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,794Updated last month
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,620Updated 2 weeks ago
- Official Code Repo for UniVA: Universal Video Agents☆343Updated 2 weeks ago
- "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"☆2,239Updated last month
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆444Updated 5 months ago
- ☆714Updated 3 months ago
- Qwen-Image-Layered: Layered Decomposition for Inherent Editablity☆1,540Updated last month
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆644Updated 3 weeks ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆582Updated 8 months ago
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆434Updated last month
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆512Updated 2 weeks ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆457Updated 3 months ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆725Updated last month
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆376Updated 2 weeks ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆498Updated 5 months ago
- 🔥🔥 Open-sourced unified customization model☆1,201Updated 4 months ago
- [ICLR 26] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling☆1,961Updated 3 weeks ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆293Updated last month
- ☆1,592Updated 2 months ago
- MoCha: End-to-End Video Character Replacement without Structural Guidance☆635Updated 3 weeks ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆870Updated this week
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆672Updated 3 months ago
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆697Updated 2 months ago
- Diffusion-based Portrait and Animal Animation☆853Updated 2 months ago