fudan-generative-vision / hallo2Links
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
☆3,603Updated 7 months ago
Alternatives and similar repositories for hallo2
Users that are interested in hallo2 are comparing it to the libraries listed below
Sorting:
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,169Updated 5 months ago
- [ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,233Updated last year
- [CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,310Updated 6 months ago
- ☆899Updated 9 months ago
- [ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation☆1,109Updated last month
- Real Time High-Fidelity Faceswap☆848Updated 4 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,584Updated last year
- Memory-Guided Diffusion for Expressive Talking Video Generation☆1,063Updated 2 months ago
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,337Updated 2 months ago
- 🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆2,619Updated last month
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation☆1,202Updated 2 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆915Updated 6 months ago
- [ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds☆2,419Updated 2 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆779Updated 11 months ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,817Updated 8 months ago
- [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Tho…☆1,046Updated 2 weeks ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,098Updated 8 months ago
- ☆1,675Updated 2 months ago
- Unofficial Implementation of Animate Anyone☆2,937Updated last year
- PantoMatrix: Generating Face and Body Animation from Speech☆1,116Updated 8 months ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,133Updated 3 weeks ago
- Customized ID Consistent for human☆972Updated 7 months ago
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆1,031Updated 6 months ago
- The official implementation of RealisDance☆599Updated 3 months ago
- Video generation from text&image, 1st-gen☆921Updated 4 months ago
- [NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"☆1,077Updated 11 months ago
- Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code☆1,064Updated 11 months ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,666Updated 9 months ago
- [CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos…☆1,378Updated 2 weeks ago
- [ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆605Updated 3 months ago