fudan-generative-vision / hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
☆3,679Updated this week
Related projects ⓘ
Alternatives and complementary repositories for hallo2
- Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,014Updated 3 months ago
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,736Updated 4 months ago
- ☆750Updated 3 weeks ago
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,294Updated 3 months ago
- Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,103Updated last year
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆9,463Updated last month
- [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"☆1,425Updated 3 months ago
- 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion☆981Updated 3 weeks ago
- Unofficial Implementation of Animate Anyone☆2,934Updated 4 months ago
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆768Updated 2 months ago
- Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language m…☆3,265Updated this week
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,515Updated this week
- Customized ID Consistent for human☆845Updated 3 months ago
- PantoMatrix: Co-Speech Talking Head and Gestures Generation☆979Updated 4 months ago
- cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注…☆2,141Updated this week
- ☆2,032Updated last month
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆639Updated last month
- The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"☆604Updated last week
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,260Updated last month
- 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models☆1,820Updated last week
- Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"☆605Updated 5 months ago
- A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基…☆1,357Updated 6 months ago
- SDG is a specialized framework designed to generate high-quality structured tabular data.☆3,274Updated this week
- SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling☆772Updated 2 weeks ago
- CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, de…☆2,971Updated this week
- csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and App…☆517Updated this week
- [ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"☆520Updated 6 months ago
- Accelerating the development of large multimodal models (LMMs) with lmms-eval☆1,936Updated this week
- Create textures for 3d models using stable-diffusion and blender☆833Updated last year
- The open source platform for AI-native application development.☆6,199Updated last week