LINs-lab / ERW
[Preprint] Efficient Generative Model Training via Embedded Representation Warmup
☆17Updated last week
Alternatives and similar repositories for ERW:
Users that are interested in ERW are comparing it to the libraries listed below
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 2 months ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆46Updated last month
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆33Updated 2 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆50Updated 3 weeks ago
- Code for paper "Principal Components" Enable A New Language of Images☆37Updated last week
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆35Updated last week
- ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning☆28Updated 3 weeks ago
- ☆41Updated 5 months ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated last month
- ☆22Updated 3 weeks ago
- ☆75Updated 3 weeks ago
- ☆28Updated 4 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆33Updated last month
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Updated last year
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆40Updated 9 months ago
- ☆27Updated last week
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing☆28Updated 4 months ago
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆18Updated 5 months ago
- ☆40Updated 9 months ago
- ☆22Updated 10 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆30Updated 2 months ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆67Updated this week
- Official implementation of MC-LLaVA.☆25Updated 2 months ago
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆19Updated last month
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆74Updated this week
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆69Updated 7 months ago
- ☆33Updated 2 months ago
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆22Updated last month
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆75Updated 4 months ago