[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Oct 18, 2024Updated last year
Alternatives and similar repositories for CSKV
Users that are interested in CSKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.☆52May 16, 2024Updated 2 years ago
- An image of ROS with Nvidia cudagl to enable GUI☆14Apr 18, 2020Updated 6 years ago
- This Repository has a project which explains how to generate a face point cloud in Three js using Tensorflow Js☆23Jun 9, 2022Updated 4 years ago
- ☆18Apr 28, 2022Updated 4 years ago
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 基于PYNQ Z2开发板与Vivado 2022.2的FPGA开发板使用教程☆35Mar 24, 2023Updated 3 years ago
- Code Repository of Evaluating Quantized Large Language Models☆135Sep 8, 2024Updated last year
- [ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs☆100Nov 25, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- 清华大学第六届人工智能挑战赛电子系赛道(原电子系第 24 届队式程序设计大赛 teamstyle24)☆28May 11, 2024Updated 2 years ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆258Feb 3, 2026Updated 4 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆38Oct 2, 2025Updated 8 months ago
- ☆10Mar 13, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jun 13, 2025Updated last year
- ☆13Jul 10, 2024Updated last year
- Activation-aware Singular Value Decomposition for Compressing Large Language Models☆91Oct 22, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated last year
- 基于Qwen-2.5-1.5B 进行DPO fine-tuning后,意外说真话的AI暴躁哥☆72Jan 18, 2025Updated last year
- ☆16Sep 4, 2025Updated 9 months ago
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆18Mar 2, 2026Updated 3 months ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Mar 5, 2025Updated last year
- ☆28Aug 9, 2025Updated 10 months ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 9 months ago
- ☆20Jun 10, 2025Updated last year
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆17Apr 2, 2025Updated last year
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆13Jun 21, 2026Updated last week
- ☆15Apr 25, 2025Updated last year
- ☆13Apr 13, 2026Updated 2 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆18Apr 2, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Sep 12, 2024Updated last year
- We propose Bidirectional Evolutionary Search (BES), a search framework that couples forward candidate evolution with backward goal decomp…☆160May 28, 2026Updated last month
- ☆14Jan 22, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 7 months ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆31Oct 23, 2025Updated 8 months ago
- ☆11Jun 22, 2025Updated last year
- ☆16Apr 30, 2024Updated 2 years ago