[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆104May 8, 2026Updated last month
Alternatives and similar repositories for lpd
Users that are interested in lpd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆47Feb 10, 2026Updated 4 months ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆76Mar 10, 2026Updated 3 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆85Jul 23, 2024Updated last year
- [NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems☆17Nov 1, 2025Updated 7 months ago
- ☆145Jun 28, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆52Mar 13, 2026Updated 3 months ago
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter☆173Feb 27, 2026Updated 3 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆217Sep 27, 2025Updated 8 months ago
- ☆41Jul 29, 2025Updated 10 months ago
- A record of reading list on some MLsys popular topic☆25Mar 20, 2025Updated last year
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆27Feb 21, 2025Updated last year
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Sep 27, 2025Updated 8 months ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆62Feb 6, 2024Updated 2 years ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆648Oct 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆315Dec 23, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆117May 3, 2025Updated last year
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆97Mar 1, 2025Updated last year
- Patch convolution to avoid large GPU memory usage of Conv2D☆97Jan 23, 2025Updated last year
- using InstantX's CSGO in comfyUI☆17Sep 7, 2024Updated last year
- ☆14Jul 17, 2024Updated last year
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated last year
- 🔥🔥🔥 Support TeaCache acceleration for 2x faster inference with minimal quality loss☆50May 6, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation☆161Mar 21, 2025Updated last year
- [ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads☆541Feb 10, 2025Updated last year
- Easy, Fast, and Scalable Multimodal AI☆126Jun 2, 2026Updated last week
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆26Mar 26, 2025Updated last year
- ☆43May 30, 2025Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- The official implementation of Recurrent Diffusion for Large-Scale Parameter Generation.☆80Sep 24, 2025Updated 8 months ago
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆423Apr 25, 2025Updated last year
- ☆30May 7, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆51Mar 25, 2025Updated last year
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆44May 21, 2025Updated last year
- Model Compression Toolbox for Large Language Models and Diffusion Models☆787Aug 14, 2025Updated 10 months ago
- ☆23Aug 17, 2024Updated last year
- A parallelism VAE avoids OOM for high resolution image generation☆94May 8, 2026Updated last month
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,570Apr 16, 2026Updated last month