FoundationVision / Infinity
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆871Updated this week
Alternatives and similar repositories for Infinity:
Users that are interested in Infinity are comparing it to the libraries listed below
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆695Updated this week
- SEED-Voken: A Series of Powerful Visual Tokenizers☆810Updated 2 weeks ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,460Updated 5 months ago
- Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,129Updated 3 weeks ago
- This repo contains the code for 1D tokenizer and generator☆645Updated this week
- Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆791Updated last month
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆742Updated 10 months ago
- Implementation of MagViT2 Tokenizer in Pytorch☆588Updated this week
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆534Updated 5 months ago
- A collection of awesome video generation studies.☆425Updated this week
- Stable Video Diffusion Training Code and Extensions.☆654Updated 5 months ago
- (NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis☆621Updated 3 months ago
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,234Updated 3 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆557Updated 2 months ago
- A reading list of video generation☆473Updated this week
- Diffusion Model-Based Image Editing: A Survey (arXiv)☆540Updated 3 weeks ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆405Updated 3 months ago
- [CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"☆535Updated last week
- VideoSys: An easy and efficient system for video generation☆1,875Updated 2 weeks ago
- PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)☆476Updated 7 months ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆480Updated 2 months ago
- Next-Token Prediction is All You Need☆1,965Updated 2 months ago
- [ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model☆393Updated 2 months ago
- [ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"☆732Updated 5 months ago
- ☆354Updated 2 months ago
- Memory-optimized training scripts for video models based on Diffusers☆730Updated this week
- Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI☆900Updated this week
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆395Updated 4 months ago
- NOVA: Autoregressive Video Generation without Vector Quantization☆314Updated this week
- A collection of resources on controllable generation with text-to-image diffusion models.☆969Updated 2 weeks ago