[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Oct 18, 2024Updated last year
Alternatives and similar repositories for CSKV
Users that are interested in CSKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.☆51May 16, 2024Updated last year
- An image of ROS with Nvidia cudagl to enable GUI☆14Apr 18, 2020Updated 5 years ago
- This Repository has a project which explains how to generate a face point cloud in Three js using Tensorflow Js☆23Jun 9, 2022Updated 3 years ago
- ☆18Apr 28, 2022Updated 3 years ago
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- 基于PYNQ Z2开发板与Vivado 2022.2的FPGA开发板使用教程☆30Mar 24, 2023Updated 2 years ago
- Code Repository of Evaluating Quantized Large Language Models☆135Sep 8, 2024Updated last year
- [ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs☆98Nov 25, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- 清华大学第六届人工智能挑战赛电子系赛道(原电子 系第 24 届队式程序设计大赛 teamstyle24)☆28May 11, 2024Updated last year
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- ☆19Jun 10, 2025Updated 9 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆246Feb 3, 2026Updated last month
- ☆10Mar 13, 2023Updated 3 years ago
- ☆13Jul 10, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 9 months ago
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆90Oct 22, 2024Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆70Jan 18, 2025Updated last year
- ☆16Sep 4, 2025Updated 6 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆14Apr 14, 2025Updated 11 months ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- ☆13Mar 5, 2025Updated last year
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 6 months ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 11 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆32Oct 30, 2025Updated 4 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- ☆14Apr 25, 2025Updated 10 months ago
- ☆13Aug 7, 2025Updated 7 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated 11 months ago
- ☆13Sep 12, 2024Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- ☆13Jan 22, 2025Updated last year
- ☆12Oct 7, 2024Updated last year
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 5 months ago
- ☆11Jun 22, 2025Updated 9 months ago