[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆87Feb 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for lpd
Users that are interested in lpd are comparing it to the libraries listed below
Sorting:
- A sparse attention kernel supporting mix sparse patterns☆467Jan 18, 2026Updated last month
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆71Jul 5, 2025Updated 7 months ago
- ☆141Jun 28, 2024Updated last year
- [ECCV 2024] Isomorphic Pruning for Vision Models☆81Jul 23, 2024Updated last year
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆50Nov 4, 2025Updated 4 months ago
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter☆138Dec 5, 2025Updated 2 months ago
- A record of reading list on some MLsys popular topic☆22Mar 20, 2025Updated 11 months ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆62Feb 6, 2024Updated 2 years ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- [NeurIPS 2025]《SD-VLM: Spatial Measuring and Understanding with Depth-encoded Vision Language Models》☆37Dec 29, 2025Updated 2 months ago
- ☆43May 30, 2025Updated 9 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Sep 27, 2025Updated 5 months ago
- The official implementation of "EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models"☆21Jul 8, 2025Updated 7 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated 11 months ago
- 🔥🔥🔥 Support TeaCache acceleration for 2x faster inference with minimal quality loss☆50May 6, 2025Updated 9 months ago
- ☆22Aug 17, 2024Updated last year
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated last year
- using InstantX's CSGO in comfyUI☆17Sep 7, 2024Updated last year
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆650Oct 16, 2024Updated last year
- ☆81Oct 18, 2025Updated 4 months ago
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆26Mar 26, 2025Updated 11 months ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆20Dec 10, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- [ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation☆151Mar 21, 2025Updated 11 months ago
- [ICML 2025] Official Implementation of Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots☆30May 28, 2025Updated 9 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆53Mar 25, 2025Updated 11 months ago
- Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding☆93Dec 2, 2025Updated 3 months ago
- Phi-3.5-vision-instruct fast talk with image☆17Aug 22, 2024Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Feb 11, 2025Updated last year
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated last year
- ☆29May 7, 2025Updated 9 months ago
- A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention☆283Dec 1, 2025Updated 3 months ago
- ☆149Feb 25, 2026Updated last week
- ☆34May 7, 2025Updated 9 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆314Dec 23, 2024Updated last year
- The official implementation of Recurrent Diffusion for Large-Scale Parameter Generation.☆78Sep 24, 2025Updated 5 months ago
- [ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads☆524Feb 10, 2025Updated last year
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,547Nov 10, 2025Updated 3 months ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training☆260Aug 9, 2025Updated 6 months ago