fudan-generative-vision / hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
☆3,513Updated 3 weeks ago
Alternatives and similar repositories for hallo2:
Users that are interested in hallo2 are comparing it to the libraries listed below
- Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,061Updated 8 months ago
- Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer☆1,149Updated last week
- ☆822Updated 3 months ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,183Updated 8 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆908Updated last week
- [ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation☆970Updated 2 weeks ago
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation☆1,105Updated 3 weeks ago
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,064Updated last month
- 🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆767Updated this week
- Video generation from text&image, 1st-gen☆828Updated last month
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆650Updated 5 months ago
- Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆555Updated 2 months ago
- Customized ID Consistent for human☆946Updated last month
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,329Updated 6 months ago
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,296Updated last month
- PantoMatrix: Generating Face and Body Animation from Speech☆994Updated 2 months ago
- [CVPR 2025🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition☆647Updated 3 weeks ago
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,530Updated 3 months ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,053Updated 4 months ago
- [NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation☆1,128Updated this week
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,688Updated 2 months ago
- Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code☆1,016Updated 5 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆763Updated 2 months ago
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,049Updated 5 months ago
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆930Updated last week
- Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds☆862Updated this week
- "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"☆490Updated 3 weeks ago
- [CVPR 2025] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion☆931Updated last week
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers"☆485Updated 3 weeks ago
- Unofficial Implementation of Animate Anyone☆2,918Updated 8 months ago