NUS-HPC-AI-Lab / VideoSys
VideoSys: An easy and efficient system for video generation
☆1,761Updated this week
Related projects ⓘ
Alternatives and complementary repositories for VideoSys
- Latte: Latent Diffusion Transformer for Video Generation.☆1,698Updated last month
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,214Updated last month
- Next-Token Prediction is All You Need☆1,793Updated 2 weeks ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,302Updated 2 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,070Updated 3 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,396Updated last month
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,790Updated last week
- [CSUR] A Survey on Video Diffusion Models☆1,796Updated this week
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆667Updated 7 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆587Updated this week
- [CVPR 2024] DeepCache: Accelerating Diffusion Models for Free☆791Updated 4 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,283Updated this week
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,668Updated last week
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆561Updated this week
- Open-MAGVIT2: Democratizing Autoregressive Visual Generation☆686Updated last month
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,413Updated 2 months ago
- [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scala…☆4,223Updated last month
- Implementation of MagViT2 Tokenizer in Pytorch☆559Updated 3 weeks ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆875Updated 2 months ago
- The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…☆492Updated 7 months ago
- Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,011Updated this week
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆916Updated last year
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆495Updated 2 months ago
- The best OSS video generation models☆1,804Updated this week
- InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)☆1,189Updated 5 months ago
- ☆601Updated this week
- Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI☆688Updated this week
- This repo contains the code for 1D tokenizer and generator☆527Updated this week
- Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation☆913Updated last week
- Emu Series: Generative Multimodal Models from BAAI☆1,659Updated last month