Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
☆3,227Jan 8, 2026Updated 3 months ago
Alternatives and similar repositories for Sonic
Users that are interested in Sonic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI☆1,132Sep 27, 2025Updated 6 months ago
- Taming Stable Diffusion for Lip Sync!☆5,569Jun 20, 2025Updated 9 months ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆4,533Feb 23, 2026Updated last month
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆299Aug 7, 2025Updated 8 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,624Jan 26, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆4,211Apr 7, 2026Updated last week
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆5,574Sep 26, 2025Updated 6 months ago
- [TMLR] Memory-Guided Diffusion for Expressive Talking Video Generation☆1,071Aug 6, 2025Updated 8 months ago
- Diffusion-based Portrait and Animal Animation☆861Dec 9, 2025Updated 4 months ago
- 一个超轻量级、可以在移动端实时运行的数字人模型☆2,463Sep 18, 2025Updated 6 months ago
- 🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.☆12,701Oct 16, 2025Updated 6 months ago
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,379Mar 13, 2025Updated last year
- talking-face video editing☆431Feb 27, 2025Updated last year
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,022Jul 2, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Real time interactive streaming digital human☆7,385Updated this week
- [CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/☆2,902Feb 19, 2025Updated last year
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,498Sep 11, 2025Updated 7 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,559Nov 18, 2025Updated 4 months ago
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆749Nov 12, 2025Updated 5 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆469Nov 10, 2025Updated 5 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆819Oct 16, 2024Updated last year
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,669Mar 10, 2025Updated last year
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆942Sep 4, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- JoyHallo: Digital human model for Mandarin☆521Sep 21, 2025Updated 6 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20,533Mar 16, 2026Updated last month
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆1,089Jun 29, 2025Updated 9 months ago
- [CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"☆1,630Sep 18, 2025Updated 6 months ago
- Bring portraits to life!☆18,117Mar 2, 2026Updated last month
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,835Jun 28, 2024Updated last year
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆450Aug 20, 2025Updated 7 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆384Jan 23, 2026Updated 2 months ago
- ☆3,306Dec 19, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,366Jan 24, 2025Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆11,972Nov 21, 2025Updated 4 months ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆586Jun 5, 2025Updated 10 months ago
- ☆2,079Dec 16, 2025Updated 4 months ago
- [ECCV 2024 Oral] EDTalk - Official PyTorch Implementation☆462Sep 29, 2025Updated 6 months ago
- [CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos…☆1,413Sep 21, 2025Updated 6 months ago
- [ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis☆1,250Mar 14, 2025Updated last year