jixiaozhong/Sonic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jixiaozhong/Sonic)

jixiaozhong / Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

☆3,266

Alternatives and similar repositories for Sonic

Users that are interested in Sonic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

smthemex / ComfyUI_Sonic
View on GitHub
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
☆1,141May 4, 2026Updated 2 months ago
antgroup / echomimic_v2
View on GitHub
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
☆4,623Feb 23, 2026Updated 5 months ago
bytedance / LatentSync
View on GitHub
Taming Stable Diffusion for Lip Sync!
☆5,931Jun 20, 2025Updated last year
toto222 / DICE-Talk
View on GitHub
DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…
☆305Aug 7, 2025Updated 11 months ago
Fantasy-AMAP / fantasy-talking
View on GitHub
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
☆1,623Jan 26, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
antgroup / echomimic
View on GitHub
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
☆4,275Apr 7, 2026Updated 3 months ago
memoavatar / memo
View on GitHub
[TMLR] Memory-Guided Diffusion for Expressive Talking Video Generation
☆1,069Aug 6, 2025Updated 11 months ago
TMElyralab / MuseTalk
View on GitHub
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
☆6,259Sep 26, 2025Updated 10 months ago
deepbrainai-research / float
View on GitHub
[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.
☆487Nov 10, 2025Updated 8 months ago
Tencent-Hunyuan / HunyuanVideo-Avatar
View on GitHub
☆2,140Dec 16, 2025Updated 7 months ago
fudan-generative-vision / hallo3
View on GitHub
[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
☆1,395Mar 13, 2025Updated last year
jdh-algo / JoyVASA
View on GitHub
Diffusion-based Portrait and Animal Animation
☆874Apr 16, 2026Updated 3 months ago
JOY-MM / JoyGen
View on GitHub
talking-face video editing
☆437Feb 27, 2025Updated last year
anliyuan / Ultralight-Digital-Human
View on GitHub
一个超轻量级、可以在移动端实时运行的数字人模型
☆2,603Jul 22, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Phantom-video / Phantom
View on GitHub
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
☆1,512Sep 11, 2025Updated 10 months ago
duixcom / Duix-Avatar
View on GitHub
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
☆14,217Apr 21, 2026Updated 3 months ago
Zejun-Yang / AniPortrait
View on GitHub
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
☆5,017Jul 2, 2024Updated 2 years ago
Saiyan-World / goku
View on GitHub
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
☆2,909Feb 19, 2025Updated last year
lipku / LiveTalking
View on GitHub
Real time interactive streaming digital human
☆8,547Jul 19, 2026Updated last week
Tencent / MimicMotion
View on GitHub
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
☆2,632Nov 18, 2025Updated 8 months ago
MeiGen-AI / MultiTalk
View on GitHub
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
☆2,976May 22, 2026Updated 2 months ago
antgroup / ditto-talkinghead
View on GitHub
[ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
☆846Nov 12, 2025Updated 8 months ago
Omni-Avatar / OmniAvatar
View on GitHub
☆1,851Aug 6, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SkyworkAI / SkyReels-V1
View on GitHub
SkyReels V1: The first and most advanced open-source human-centric video foundation model
☆2,692Mar 10, 2025Updated last year
yerfor / MimicTalk
View on GitHub
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
☆825Oct 16, 2024Updated last year
jdh-algo / JoyHallo
View on GitHub
JoyHallo: Digital human model for Mandarin
☆520Sep 21, 2025Updated 10 months ago
Fantasy-AMAP / fantasy-portrait
View on GitHub
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
☆511Aug 20, 2025Updated 11 months ago
ShmuelRonen / ComfyUI-LatentSyncWrapper
View on GitHub
This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…
☆957Sep 4, 2025Updated 10 months ago
warmshao / FasterLivePortrait
View on GitHub
Bring portraits to life in Real Time！onnx/tensorrt support！实时肖像驱动！
☆1,161Jun 29, 2025Updated last year
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
ZiqiaoPeng / SyncTalk
View on GitHub
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
☆1,627Sep 18, 2025Updated 10 months ago
KlingAIResearch / LivePortrait
View on GitHub
Bring portraits to life!
☆18,827Jun 1, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HumanAIGC-Engineering / OpenAvatarChat
View on GitHub
☆3,655Jun 9, 2026Updated last month
TMElyralab / MuseV
View on GitHub
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
☆2,843Jun 28, 2024Updated 2 years ago
antonibigata / keysync
View on GitHub
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
☆395Jan 23, 2026Updated 6 months ago
harlanhong / ACTalker
View on GitHub
ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…
☆461Aug 20, 2025Updated 11 months ago
tencent-ailab / V-Express
View on GitHub
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
☆2,357Jan 24, 2025Updated last year
SkyworkAI / SkyReels-A1
View on GitHub
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
☆581Jun 5, 2025Updated last year
Tencent-Hunyuan / HunyuanVideo
View on GitHub
HunyuanVideo: A Systematic Framework For Large Video Generation Model
☆12,378Jun 29, 2026Updated last month