[CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
☆55Dec 16, 2025Updated 2 months ago
Alternatives and similar repositories for Soul
Users that are interested in Soul are comparing it to the libraries listed below
Sorting:
- ☆36Dec 16, 2025Updated 2 months ago
- CVPR2025-3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations☆36Sep 3, 2025Updated 6 months ago
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆21Jan 14, 2026Updated last month
- Official implementation of the paper "HRAvatar: High-Quality and Relightable Gaussian Head Avatar" [CVPR 2025]☆101Jul 29, 2025Updated 7 months ago
- ☆30Jun 30, 2025Updated 8 months ago
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆125Oct 14, 2025Updated 4 months ago
- ☆52Jan 15, 2026Updated last month
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated 11 months ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions…☆22Dec 12, 2024Updated last year
- [T-PAMI 2025] EMOv2: Pushing 5M Vision Model Frontier☆54Dec 30, 2024Updated last year
- [CVPR2025] KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation☆69Apr 8, 2025Updated 11 months ago
- [T-PAMI2025] Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy☆28Jan 13, 2025Updated last year
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- Wan 2.5 AI Video Generator - Transform text & images into HD videos with synchronized audio☆79Sep 25, 2025Updated 5 months ago
- ☆72May 14, 2023Updated 2 years ago
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆38Jan 28, 2025Updated last year
- [AAAI2025] GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians☆37Apr 2, 2025Updated 11 months ago
- Code for CVPR 2024 paper: ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis☆35Apr 29, 2025Updated 10 months ago
- Leveraging A-priori Knowledge in Predictive Business Process Monitoring☆10Jul 16, 2018Updated 7 years ago
- ☆29Dec 3, 2025Updated 3 months ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆14Jul 31, 2025Updated 7 months ago
- Revolutionizing MMD animation with real-time motion capture powered by MediaPipe! Create professional animations in minutes, not hours. P…☆21Feb 23, 2026Updated 2 weeks ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated last year
- [CVPR 2025 Highlight] Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPs☆69Jun 18, 2025Updated 8 months ago
- FFHQ-UV-Intrinstics: A Dataset Containing Intrinsic Face Decomposition☆46Mar 25, 2024Updated last year
- A vision-language model with an improved cross-attention mechanism for scalable streaming inference☆27Dec 24, 2025Updated 2 months ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- Official implementation for the paper: "Real-Time Inverse Kinematics for Generating Multi-Constrained Movements of Virtual Human Characte…☆18Feb 19, 2026Updated 2 weeks ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 7 months ago
- Interactive visualization of the output of any binary classifier.☆14Oct 15, 2020Updated 5 years ago
- 腾讯云COS图床智能上传工具编写☆10Jan 14, 2019Updated 7 years ago
- ☆11Nov 21, 2022Updated 3 years ago
- A simple python wrapper for gpupixel using SourceRawDataInput and TargetRawDataOutput.☆11Aug 14, 2024Updated last year
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated 2 months ago
- Webpage of "Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer"☆11Jul 2, 2024Updated last year
- [ICLR 2024] Neural Processing of Tri-Plane Hybrid Neural Fields☆14Feb 21, 2026Updated 2 weeks ago
- This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.☆19Nov 10, 2025Updated 3 months ago
- AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinement☆279Dec 5, 2025Updated 3 months ago
- ICLR 2025 paper X-NeMo & Project X-Portrati2☆117Aug 7, 2025Updated 7 months ago