DAMO-NLP-SG / DiGIT
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆57Updated 2 months ago
Alternatives and similar repositories for DiGIT:
Users that are interested in DiGIT are comparing it to the libraries listed below
- ☆128Updated last month
- ☆43Updated 2 weeks ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 2 months ago
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆72Updated 3 weeks ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆62Updated 3 months ago
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆95Updated last week
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆26Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation☆40Updated last month
- This is the official implementation for ControlVAR.☆88Updated last month
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆45Updated 3 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆44Updated 3 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆23Updated 2 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆75Updated 2 months ago
- ☆26Updated 5 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆67Updated last month
- Liquid: Language Models are Scalable Multi-modal Generators☆60Updated last month
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆36Updated 3 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆103Updated 2 weeks ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆59Updated 3 months ago
- a collection of awesome autoregressive visual generation models☆63Updated 2 weeks ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆47Updated last week
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆34Updated last month
- [NeurIPS 2024] Efficient Multi-modal Models via Stage-wise Visual Context Compression☆50Updated 5 months ago
- ☆112Updated 6 months ago
- ☆43Updated 4 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 6 months ago
- XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation☆178Updated last month
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆106Updated 2 weeks ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆59Updated 7 months ago