LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
☆148Jun 11, 2025Updated 8 months ago
Alternatives and similar repositories for llia
Users that are interested in llia are comparing it to the libraries listed below
Sorting:
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆291Aug 7, 2025Updated 6 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆500Aug 20, 2025Updated 6 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Apr 8, 2025Updated 10 months ago
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 11 months ago
- ☆63Dec 1, 2025Updated 3 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆460Nov 10, 2025Updated 3 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,622Jan 26, 2026Updated last month
- ☆156Dec 23, 2025Updated 2 months ago
- A 2D customized lip-sync model for high-fidelity real-time driving.☆125Jun 26, 2025Updated 8 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆283Feb 19, 2026Updated last week
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆584Jun 5, 2025Updated 8 months ago
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆395Apr 8, 2025Updated 10 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆376Jan 23, 2026Updated last month
- [TPAMI2025] Code for my paper "Semi-Supervised Unconstrained Head Pose Estimation in the Wild"☆18Sep 25, 2025Updated 5 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Oct 3, 2025Updated 4 months ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- ☆1,790Aug 6, 2025Updated 6 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆68Jul 21, 2024Updated last year
- This is official inference code of PD-FGC☆100Oct 15, 2023Updated 2 years ago
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆237Nov 12, 2025Updated 3 months ago
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆1,063Jun 29, 2025Updated 8 months ago
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆261Jan 30, 2025Updated last year
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Oct 20, 2025Updated 4 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆704Jun 3, 2025Updated 8 months ago
- ☆19May 2, 2024Updated last year
- [ICML 2025] Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion☆34Nov 10, 2025Updated 3 months ago
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆714Nov 12, 2025Updated 3 months ago
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆143Dec 5, 2023Updated 2 years ago
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,813Dec 18, 2025Updated 2 months ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions…☆22Dec 12, 2024Updated last year
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- TalkingMachines☆178Aug 2, 2025Updated 7 months ago
- ☆25Dec 19, 2024Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆34Sep 25, 2025Updated 5 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 4 months ago
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆46Nov 12, 2025Updated 3 months ago
- ☆421Jun 30, 2025Updated 8 months ago