[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Oct 18, 2024Updated last year
Alternatives and similar repositories for CSKV
Users that are interested in CSKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.☆52May 16, 2024Updated 2 years ago
- An image of ROS with Nvidia cudagl to enable GUI☆14Apr 18, 2020Updated 6 years ago
- This Repository has a project which explains how to generate a face point cloud in Three js using Tensorflow Js☆23Jun 9, 2022Updated 3 years ago
- ☆18Apr 28, 2022Updated 4 years ago
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于PYNQ Z2开发板与Vivado 2022.2的FPGA开发板使用教程☆34Mar 24, 2023Updated 3 years ago
- Code Repository of Evaluating Quantized Large Language Models☆135Sep 8, 2024Updated last year
- [ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs☆100Nov 25, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 10 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- 清华大学第六届人工智能挑战赛电子系赛道(原电子系第 24 届队式程序设计大赛 teamstyle24)☆28May 11, 2024Updated 2 years ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆35Oct 2, 2025Updated 7 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆250Feb 3, 2026Updated 3 months ago
- ☆10Mar 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Jun 13, 2025Updated 11 months ago
- ☆13Jul 10, 2024Updated last year
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆92Oct 22, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 10 months ago
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆72Jan 18, 2025Updated last year
- ☆16Sep 4, 2025Updated 8 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆18Mar 2, 2026Updated 2 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Mar 5, 2025Updated last year
- ☆25Aug 9, 2025Updated 9 months ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 7 months ago
- ☆20Jun 10, 2025Updated 11 months ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆13Apr 2, 2025Updated last year
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated last month
- ☆14Apr 25, 2025Updated last year
- ☆13Apr 13, 2026Updated last month
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Sep 12, 2024Updated last year
- ☆14Jan 22, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 6 months ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆31Oct 23, 2025Updated 6 months ago
- ☆11Jun 22, 2025Updated 10 months ago
- ☆16Apr 30, 2024Updated 2 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago