LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
☆149Jun 11, 2025Updated 10 months ago
Alternatives and similar repositories for llia
Users that are interested in llia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 10 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆299Aug 7, 2025Updated 8 months ago
- ☆63Dec 1, 2025Updated 4 months ago
- ☆166Dec 23, 2025Updated 3 months ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆502Aug 20, 2025Updated 7 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆467Nov 10, 2025Updated 5 months ago
- A 2D customized lip-sync model for high-fidelity real-time driving.☆127Jun 26, 2025Updated 9 months ago
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Apr 8, 2025Updated last year
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆586Jun 5, 2025Updated 10 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆285Mar 14, 2026Updated 3 weeks ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,624Jan 26, 2026Updated 2 months ago
- ☆25Dec 19, 2024Updated last year
- This is official inference code of PD-FGC☆100Oct 15, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆384Jan 23, 2026Updated 2 months ago
- TalkingMachines☆179Aug 2, 2025Updated 8 months ago
- [TPAMI2025] Code for my paper "Semi-Supervised Unconstrained Head Pose Estimation in the Wild"☆18Sep 25, 2025Updated 6 months ago
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆396Apr 8, 2025Updated last year
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- ☆1,810Aug 6, 2025Updated 8 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆68Jul 21, 2024Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,885Dec 18, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆34Sep 25, 2025Updated 6 months ago
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆1,089Jun 29, 2025Updated 9 months ago
- An efficient distillation method for flow matching models☆25Feb 1, 2026Updated 2 months ago
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆237Nov 12, 2025Updated 4 months ago
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆143Dec 5, 2023Updated 2 years ago
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆266Jan 30, 2025Updated last year
- Drive your metahuman to speak within 1 second.☆11Mar 21, 2025Updated last year
- ☆435Jun 30, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Efficient Long-duration Talking Video Synthesis with Linear Diffusion Transformer under Multimodal Guidance☆61Oct 20, 2025Updated 5 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆102Oct 3, 2025Updated 6 months ago
- Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …☆126Mar 2, 2026Updated last month
- SkyReels-A2: Compose anything in video diffusion transformers☆710Jun 3, 2025Updated 10 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 5 months ago
- [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆855Mar 18, 2026Updated 3 weeks ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year