LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
☆150Jun 11, 2025Updated 10 months ago
Alternatives and similar repositories for llia
Users that are interested in llia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 11 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆300Aug 7, 2025Updated 8 months ago
- ☆63Dec 1, 2025Updated 5 months ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- ☆177Dec 23, 2025Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆506Aug 20, 2025Updated 8 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆472Nov 10, 2025Updated 5 months ago
- A 2D customized lip-sync model for high-fidelity real-time driving.☆129Jun 26, 2025Updated 10 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆70Apr 8, 2025Updated last year
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,627Jan 26, 2026Updated 3 months ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆586Jun 5, 2025Updated 10 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆284Mar 14, 2026Updated last month
- ☆25Dec 19, 2024Updated last year
- This is official inference code of PD-FGC☆100Oct 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆384Jan 23, 2026Updated 3 months ago
- ☆1,815Aug 6, 2025Updated 8 months ago
- TalkingMachines☆179Aug 2, 2025Updated 9 months ago
- [TPAMI2025] Code for my paper "Semi-Supervised Unconstrained Head Pose Estimation in the Wild"☆19Sep 25, 2025Updated 7 months ago
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆397Apr 8, 2025Updated last year
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆1,102Jun 29, 2025Updated 10 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆69Jul 21, 2024Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,910Dec 18, 2025Updated 4 months ago
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Apr 7, 2026Updated 3 weeks ago
- An efficient distillation method for flow matching models☆26Feb 1, 2026Updated 3 months ago
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆237Nov 12, 2025Updated 5 months ago
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆143Dec 5, 2023Updated 2 years ago
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆267Jan 30, 2025Updated last year
- Drive your metahuman to speak within 1 second.☆11Mar 21, 2025Updated last year
- ☆442Jun 30, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆104Oct 3, 2025Updated 6 months ago
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Oct 20, 2025Updated 6 months ago
- Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …☆130Mar 2, 2026Updated 2 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆711Jun 3, 2025Updated 10 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 6 months ago
- Official repo for FaceShot: Bring Any Character into Life☆82Jun 30, 2025Updated 10 months ago
- [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆885Mar 18, 2026Updated last month