lichao-sun / Mora
Mora: More like Sora for Generalist Video Generation
☆1,517Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Mora
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,086Updated 3 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,693Updated last month
- Latte: Latent Diffusion Transformer for Video Generation.☆1,710Updated last month
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆2,077Updated 6 months ago
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆2,971Updated 3 weeks ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,589Updated 2 months ago
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,427Updated this week
- VideoSys: An easy and efficient system for video generation☆1,775Updated this week
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆884Updated last week
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,218Updated last month
- Character Animation (AnimateAnyone, Face Reenactment)☆3,185Updated 5 months ago
- InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥☆1,676Updated 2 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,809Updated 3 weeks ago
- The best OSS video generation models☆2,050Updated this week
- Next-Token Prediction is All You Need☆1,824Updated 3 weeks ago
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction☆916Updated last week
- ☆2,898Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,490Updated this week
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,255Updated last week
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,300Updated 3 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆1,084Updated last week
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,340Updated this week
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,409Updated last month
- A general fine-tuning kit geared toward diffusion models.☆1,811Updated this week
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,122Updated 2 months ago
- ☆731Updated 9 months ago
- Mixture-of-Experts for Large Vision-Language Models☆1,989Updated 6 months ago
- ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆1,091Updated 4 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,840Updated 3 months ago
- Fine-Grained Open Domain Image Animation with Motion Guidance☆788Updated last month