zhangzjn/Soul

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhangzjn/Soul)

zhangzjn / Soul

[CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

☆64

Alternatives and similar repositories for Soul

Users that are interested in Soul are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhangzjn / T3-Video
View on GitHub
[ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation
☆41Dec 16, 2025Updated 7 months ago
wencanjiang / SPIKE
View on GitHub
An adaptive dual controller framework for cost-efficient long-horizon game control.
☆18May 20, 2026Updated 2 months ago
HaojunChen663 / PixVerve-95K
View on GitHub
Official repository for the paper "PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset"
☆31Jul 10, 2026Updated 2 weeks ago
juntaoJianggavin / M3CoTBench
View on GitHub
Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"
☆30Jan 14, 2026Updated 6 months ago
juntaoJianggavin / APRIL-MedSeg
View on GitHub
A Modern Modular 2D Medical Image Segmentation Toolbox
☆51Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Ryan-w2024 / PoseAnything
View on GitHub
☆39Jan 21, 2026Updated 6 months ago
rain152 / LFA-Video-Generation
View on GitHub
From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts
☆28Jan 12, 2026Updated 6 months ago
sjtuplayer / Harmony
View on GitHub
Audio-video joint generation
☆58Nov 27, 2025Updated 8 months ago
lyrig / TokenAR
View on GitHub
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
☆22Mar 4, 2026Updated 4 months ago
latentcraft / replay
View on GitHub
[CVPR 2026] Boosting Reasoning in Large Multimodal Models via Activation Replay
☆24May 7, 2026Updated 2 months ago
xlyu0106 / MACT
View on GitHub
☆19Jul 31, 2025Updated 11 months ago
LsmnBmnc / Med-CMR
View on GitHub
Official code repository for Med-CMR : "A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multi…
☆26Dec 10, 2025Updated 7 months ago
YinBo0927 / FeRA
View on GitHub
[ICML 2026] The official code of FeRA: Frequency–Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
☆29Dec 27, 2025Updated 7 months ago
Yuan-Hou / Human-MME
View on GitHub
Official repository for "Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models"
☆22Dec 2, 2025Updated 7 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
HUuxiaobin / VTBench
View on GitHub
☆23May 26, 2025Updated last year
xlyu0106 / ViF
View on GitHub
[ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
☆44Oct 3, 2025Updated 9 months ago
sjtuplayer / UltraGen
View on GitHub
[AAAI 2026] UltraGen
☆77Feb 1, 2026Updated 5 months ago
MCG-NJU / Sora2-mini
View on GitHub
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
☆57Dec 16, 2025Updated 7 months ago
fudan-generative-vision / hallo4
View on GitHub
[SIGGRAPH Asia 2025] Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
☆38Nov 30, 2025Updated 7 months ago
sjtuplayer / IAR
View on GitHub
[CVPR25] IAR
☆18Jun 13, 2025Updated last year
congwei1230 / MoCha-Demo
View on GitHub
[NeurIPS 2025 Spotlight] Demo implementation of MoCha Towards Movie-Grade Talking Character Synthesis
☆17Dec 27, 2025Updated 7 months ago
xlyu0106 / VisMem
View on GitHub
☆91Feb 5, 2026Updated 5 months ago
zhangzjn / EMOv2
View on GitHub
[T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier
☆54Dec 30, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
BigAandSmallq / SAD
View on GitHub
Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework f…
☆31Nov 4, 2025Updated 8 months ago
OmniForcing / OmniForcing
View on GitHub
[ECCV 2026 Oral] Official implementation of "OmniForcing: Unleashing Real-time Joint Audio-Visual Generation"[arXiv:2603.11647]. OmniForc…
☆171Updated this week
KlingAIResearch / MemFlow
View on GitHub
Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"
☆216Dec 29, 2025Updated 7 months ago
xyz123xyz456 / hallo4
View on GitHub
☆61Dec 1, 2025Updated 7 months ago
wyhsirius / LIA-X
View on GitHub
LIA-X: Interpretable Latent Portrait Animator
☆105Sep 17, 2025Updated 10 months ago
NJU-PCALab / L2P
View on GitHub
L2P: Unlocking Latent Potential for Pixel Generation
☆39May 22, 2026Updated 2 months ago
sony / mmaudiosep
View on GitHub
☆16Apr 30, 2026Updated 2 months ago
hyj542682306 / Semantic-Frame-Interpolation
View on GitHub
☆21Jul 8, 2025Updated last year
AIGC-Explorer / TIMotion
View on GitHub
☆50Jan 15, 2026Updated 6 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
NUS-Project / MedMASLab
View on GitHub
☆30Mar 22, 2026Updated 4 months ago
Fictionarry / InsTaG
View on GitHub
[CVPR'25] InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
☆175Jul 15, 2025Updated last year
OpenVE-Team / OpenVE-3M
View on GitHub
OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing
☆51Apr 15, 2026Updated 3 months ago
Fantasyele / LLaVA-KD
View on GitHub
[ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
☆134Oct 14, 2025Updated 9 months ago
Briley-byl123 / MPER
View on GitHub
2025ICASSP
☆17Jun 23, 2025Updated last year
TencentYoutuResearch / T2I-L2P
View on GitHub
Code for "L2P: Unlocking Latent Potential for Pixel Generation"
☆179Jul 11, 2026Updated 2 weeks ago
showlab / Multi-human-Talking-Video-Dataset
View on GitHub
Muti-human Interactive Talking Dataset
☆75Aug 6, 2025Updated 11 months ago