Alpha-VLLM / Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
☆500Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Lumina-mGPT
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆340Updated last month
- Memory optimized finetuning scripts for CogVideoX using TorchAO and DeepSpeed☆406Updated this week
- Multimodal Models in Real World☆403Updated 3 weeks ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆250Updated 3 weeks ago
- ☆349Updated last month
- I'm back! Implementations of Meissonic developed by Community~If you feel it is helpful, plz consider giving a star❤️☆249Updated last week
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆371Updated 2 months ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆445Updated 5 months ago
- ☆254Updated 3 months ago
- Official repository for the paper PLLaVA☆593Updated 3 months ago
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆460Updated 2 months ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆580Updated 2 weeks ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆361Updated 2 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆386Updated 4 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆212Updated 3 months ago
- Stable Video Diffusion Training Code and Extensions.☆607Updated 3 months ago
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆435Updated 2 months ago
- Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen☆331Updated last month
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆364Updated 3 weeks ago
- Open-MAGVIT2: Democratizing Autoregressive Visual Generation☆705Updated last month
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆243Updated 2 weeks ago
- [ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.☆492Updated 8 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆526Updated 3 weeks ago
- Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆402Updated last month
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆240Updated last month
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆389Updated last week
- ☆193Updated 4 months ago
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model☆390Updated 5 months ago
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆509Updated last month
- Official implementation of the ECCV paper "SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"☆232Updated last month