fudan-generative-vision / hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
☆3,419Updated last month
Alternatives and similar repositories for hallo2:
Users that are interested in hallo2 are comparing it to the libraries listed below
- Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆954Updated 5 months ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,117Updated 6 months ago
- The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"☆932Updated last week
- ☆746Updated last month
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,119Updated 4 months ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆927Updated 2 months ago
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,244Updated last week
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,439Updated last month
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,247Updated last week
- 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion☆858Updated 3 months ago
- Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion I…☆840Updated 2 months ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆602Updated 3 months ago
- Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆488Updated last week
- Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks☆517Updated this week
- Unofficial Implementation of Animate Anyone☆2,888Updated 6 months ago
- PantoMatrix: Generating Face and Body Animation from Speech☆918Updated this week
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆965Updated 2 weeks ago
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆796Updated this week
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions