LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
☆148Jun 11, 2025Updated 9 months ago
Alternatives and similar repositories for llia
Users that are interested in llia are comparing it to the libraries listed below
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 10 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆296Aug 7, 2025Updated 7 months ago
- ☆63Dec 1, 2025Updated 3 months ago
- ☆160Dec 23, 2025Updated 2 months ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆503Aug 20, 2025Updated 7 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆463Nov 10, 2025Updated 4 months ago
- A 2D customized lip-sync model for high-fidelity real-time driving.☆125Jun 26, 2025Updated 8 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Apr 8, 2025Updated 11 months ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆584Jun 5, 2025Updated 9 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆284Mar 14, 2026Updated last week
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,623Jan 26, 2026Updated last month
- ☆25Dec 19, 2024Updated last year
- This is official inference code of PD-FGC☆100Oct 15, 2023Updated 2 years ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆380Jan 23, 2026Updated last month
- TalkingMachines☆179Aug 2, 2025Updated 7 months ago
- [TPAMI2025] Code for my paper "Semi-Supervised Unconstrained Head Pose Estimation in the Wild"☆18Sep 25, 2025Updated 5 months ago
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆396Apr 8, 2025Updated 11 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated last year
- ☆1,806Aug 6, 2025Updated 7 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆68Jul 21, 2024Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,840Dec 18, 2025Updated 3 months ago
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Sep 25, 2025Updated 5 months ago
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆1,077Jun 29, 2025Updated 8 months ago
- An efficient distillation method for flow matching models☆25Feb 1, 2026Updated last month
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆239Nov 12, 2025Updated 4 months ago
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆264Jan 30, 2025Updated last year
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆144Dec 5, 2023Updated 2 years ago
- ☆427Jun 30, 2025Updated 8 months ago
- Drive your metahuman to speak within 1 second.☆11Mar 21, 2025Updated last year
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Oct 20, 2025Updated 5 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆101Oct 3, 2025Updated 5 months ago
- Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …☆123Mar 2, 2026Updated 2 weeks ago
- SkyReels-A2: Compose anything in video diffusion transformers☆706Jun 3, 2025Updated 9 months ago
- [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆819Updated this week
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 5 months ago
- The homepage of LongCat-Video-Avatar☆167Dec 18, 2025Updated 3 months ago