LINs-lab / ERW
[Preprint] Efficient Generative Model Training via Embedded Representation Warmup
☆18Updated last month
Alternatives and similar repositories for ERW
Users that are interested in ERW are comparing it to the libraries listed below
Sorting:
- Autoregressive Image Generation with Randomized Parallel Decoding☆59Updated last month
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 3 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 3 months ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆47Updated last month
- Code for paper "Principal Components" Enable A New Language of Images☆40Updated last month
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆34Updated 2 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆48Updated last month
- ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning☆32Updated last month
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆31Updated 3 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆68Updated 6 months ago
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆71Updated 8 months ago
- No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves☆31Updated this week
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆100Updated last month
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆29Updated last week
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆19Updated last month
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆79Updated 3 weeks ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆53Updated 3 weeks ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆73Updated 2 months ago
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆34Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 10 months ago
- ☆23Updated last month
- Official Repository of Personalized Visual Instruct Tuning☆28Updated 2 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆58Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆63Updated last year
- ☆42Updated 3 weeks ago
- ☆80Updated last month
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆86Updated 7 months ago
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆37Updated 3 weeks ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆81Updated 3 weeks ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆105Updated last month