lixinyyang / MoDALinks
MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
☆77Updated this week
Alternatives and similar repositories for MoDA
Users that are interested in MoDA are comparing it to the libraries listed below
Sorting:
- [SIGGRAPH'25] SOAP: Style-Omniscient Animatable Portraits☆428Updated 2 weeks ago
- [CVPR2025] AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction☆440Updated 5 months ago
- ☆410Updated 5 months ago
- A Native Multimodal LLM for 3D Generation and Understanding☆470Updated last month
- Video generation from text&image, 1st-gen☆922Updated 3 months ago
- [CVPR 2025] A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation☆267Updated 5 months ago
- Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"☆201Updated 2 weeks ago
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)☆205Updated last year
- The repository for 'Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid'☆141Updated 3 months ago
- ☆133Updated 2 months ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)☆715Updated 3 months ago
- Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, C…☆302Updated 11 months ago
- Efficient DiT architecture for text2any tasks, ICLR2025☆451Updated 3 months ago
- Code for paper "Towards Understanding Camera Motions in Any Video"☆207Updated last week
- CVPR 2025 Highlight☆34Updated 2 months ago
- [ICML 2023 Oral, NeurIPS 2023] Official implementations for paper: Customizable Image Synthesis with Multiple Subjects☆442Updated last year
- ☆167Updated last year
- GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation☆497Updated last year
- Unified Multimodal Model for image generation/editing/understanding☆750Updated last week
- Free-T2M: Frequency enhanced text-to-motion diffusion model with consistency loss☆66Updated 6 months ago
- 🔥 [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUX☆258Updated last month
- The official implementation of "MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing".☆194Updated 3 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)☆254Updated 4 months ago
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE☆305Updated last week
- The official implementation of RealisDance☆593Updated 2 months ago
- Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".☆585Updated last month
- ☆74Updated 5 months ago
- ☆41Updated last year
- [CVPR 2025] Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency☆49Updated 3 weeks ago
- [ICCV 2025] The official implementation of MotionLab☆144Updated 2 months ago