[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Oct 18, 2024Updated last year
Alternatives and similar repositories for CSKV
Users that are interested in CSKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.☆51May 16, 2024Updated last year
- An image of ROS with Nvidia cudagl to enable GUI☆14Apr 18, 2020Updated 6 years ago
- This Repository has a project which explains how to generate a face point cloud in Three js using Tensorflow Js☆23Jun 9, 2022Updated 3 years ago
- ☆18Apr 28, 2022Updated 3 years ago
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 基于PYNQ Z2开发板与Vivado 2022.2的FPGA开发板使用教程☆32Mar 24, 2023Updated 3 years ago
- Code Repository of Evaluating Quantized Large Language Models☆134Sep 8, 2024Updated last year
- [ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs☆98Nov 25, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 9 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- 清华大学第六届人工智能挑战赛电子系赛道(原电子系第 24 届队式程序设计大赛 teamstyle24)☆28May 11, 2024Updated last year
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆31Oct 2, 2025Updated 6 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆248Feb 3, 2026Updated 2 months ago
- ☆10Mar 13, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Jul 10, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 9 months ago
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆91Oct 22, 2024Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆70Jan 18, 2025Updated last year
- ☆16Sep 4, 2025Updated 7 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆18Mar 2, 2026Updated last month
- ☆14Apr 14, 2025Updated last year
- FlexiTokens☆19Dec 27, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆13Nov 1, 2025Updated 5 months ago
- ☆13Mar 5, 2025Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 5 months ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 6 months ago
- ☆19Jun 10, 2025Updated 10 months ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated last year
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Updated this week
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆33Oct 30, 2025Updated 5 months ago
- ☆14Apr 25, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Updated this week
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- ☆13Sep 12, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 5 months ago
- ☆12Oct 7, 2024Updated last year
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆30Oct 23, 2025Updated 5 months ago