[NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression
☆50Mar 13, 2026Updated 3 weeks ago
Alternatives and similar repositories for ScaleKV
Users that are interested in ScaleKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆163Dec 1, 2025Updated 4 months ago
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆214Sep 27, 2025Updated 6 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆64Sep 27, 2025Updated 6 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆117Jul 9, 2025Updated 9 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆109Nov 27, 2024Updated last year
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆94Mar 12, 2026Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆45Nov 25, 2025Updated 4 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆125Updated this week
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 5 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆80Dec 10, 2024Updated last year
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- PISCO: Precise Video Instance Insertion with Sparse Control☆56Feb 13, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆51Jun 6, 2025Updated 10 months ago
- [NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models☆186Jan 1, 2025Updated last year
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- ☆32Oct 4, 2025Updated 6 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆57Sep 19, 2024Updated last year
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 10 months ago
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆43Oct 28, 2025Updated 5 months ago
- Vision Bridge Transformer at Scale☆141Dec 1, 2025Updated 4 months ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆86Sep 18, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 5 months ago
- Vocabulary Parallelism☆25Mar 10, 2025Updated last year
- [IEEE TII 2025] Official Implementation for "Dual-Detector Reoptimization for Federated Weakly Supervised Video Anomaly Detection via Ada…☆27Nov 11, 2025Updated 5 months ago
- [ICRA2025] RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition☆31May 23, 2025Updated 10 months ago
- LibAFLGo: Evaluating and Advancing Directed Greybox Fuzzing☆25Mar 4, 2026Updated last month
- [ICLR 2026] Autoregressive Image Generation with Randomized Parallel Decoding☆89Feb 16, 2026Updated last month
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated last year
- A Massive Multi-Discipline Lecture Understanding Benchmark☆34Nov 1, 2025Updated 5 months ago
- ☆239Nov 19, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Feb 10, 2025Updated last year
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆29Mar 26, 2025Updated last year
- MarkBind is a tool for generating content-heavy websites from source files in Markdown format☆155Updated this week
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆25Jun 4, 2025Updated 10 months ago
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆30Feb 6, 2026Updated 2 months ago
- ☆26Mar 5, 2026Updated last month
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆52Updated this week