Alpha-VLLM / Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
☆543Updated 6 months ago
Alternatives and similar repositories for Lumina-mGPT:
Users that are interested in Lumina-mGPT are comparing it to the libraries listed below
- Multimodal Models in Real World☆437Updated 3 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆285Updated 2 weeks ago
- ☆424Updated 2 months ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆418Updated 4 months ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆405Updated 5 months ago
- Official implementation of OneDiffusion paper☆596Updated 2 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆385Updated this week
- Memory-optimized training scripts for video models based on Diffusers☆860Updated this week
- ☆355Updated 4 months ago
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆281Updated last month
- Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆957Updated this week
- Official repository for the paper PLLaVA☆637Updated 6 months ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆483Updated 8 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆273Updated 6 months ago
- Let's finetune video generation models!☆394Updated this week
- Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation☆504Updated 5 months ago
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆285Updated 2 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆232Updated 6 months ago
- Stable Video Diffusion Training Code and Extensions.☆668Updated 6 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆572Updated 3 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆435Updated 2 months ago
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆381Updated 5 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆195Updated last week
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆449Updated last month
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆243Updated last week
- ☆209Updated 6 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆830Updated this week
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆398Updated 7 months ago
- Enhance-A-Video: Better Generated Video for Free☆390Updated this week
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆262Updated 2 months ago