wln20/CSKV

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wln20/CSKV)

wln20 / CSKV

[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios

☆16

Alternatives and similar repositories for CSKV

Users that are interested in CSKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wln20 / Attention-Viewer
View on GitHub
A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.
☆52May 16, 2024Updated 2 years ago
HoraN1 / docker-ros-gui
View on GitHub
An image of ROS with Nvidia cudagl to enable GUI
☆14Apr 18, 2020Updated 6 years ago
techtee-ltd / TensorFlow_Threejs_FaceMesh
View on GitHub
This Repository has a project which explains how to generate a face point cloud in Three js using Tensorflow Js
☆23Jun 9, 2022Updated 4 years ago
LightR0 / hugging_face_tutorials
View on GitHub
☆18Apr 28, 2022Updated 4 years ago
tongyx361 / symeval
View on GitHub
Evaluation utilities based on SymPy.
☆22Dec 12, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
thu-nics / qllm-eval
View on GitHub
Code Repository of Evaluating Quantized Large Language Models
☆135Sep 8, 2024Updated last year
pprp / Pruner-Zero
View on GitHub
[ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs
☆100Nov 25, 2024Updated last year
Wangmerlyn / KeepGPU
View on GitHub
KeepGPU is a simple CLI app that keeps your GPUs running.
☆39Jul 4, 2026Updated 2 weeks ago
WangHaoZhe / PYNQ-Tutorial
View on GitHub
基于PYNQ Z2开发板与Vivado 2022.2的FPGA开发板使用教程
☆36Mar 24, 2023Updated 3 years ago
eesast / THUAI6
View on GitHub
清华大学第六届人工智能挑战赛电子系赛道（原电子系第 24 届队式程序设计大赛 teamstyle24）
☆29May 11, 2024Updated 2 years ago
Yarayx / livelongbench
View on GitHub
The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…
☆12Jun 28, 2025Updated last year
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
mlzoo / BaoZaoAI
View on GitHub
基于Qwen-2.5-1.5B 进行DPO fine-tuning后，意外说真话的AI暴躁哥
☆72Jan 18, 2025Updated last year
hahnyuan / ASVD4LLM
View on GitHub
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆92Oct 22, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
aypan17 / reward-misspecification
View on GitHub
☆10Mar 13, 2023Updated 3 years ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
cocoshe / MIMIGenRec
View on GitHub
A Flexible Framework for Generative Recommendation
☆47Apr 9, 2026Updated 3 months ago
CUC-MIPG / UnifyEdit
View on GitHub
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
☆13Dec 29, 2024Updated last year
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆80Oct 9, 2025Updated 9 months ago
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆13Jun 21, 2026Updated last month
neulab / cmulab
View on GitHub
CMU Linguistic Annotation Backend
☆15Sep 22, 2025Updated 9 months ago
Vinoground / Vinoground
View on GitHub
☆13Apr 13, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
rsshyam / GRPO-bandits
View on GitHub
☆13Sep 12, 2024Updated last year
haodonga / CIFAR-Dataset-master
View on GitHub
☆105Jul 15, 2021Updated 5 years ago
smallporridge / TrustworthyRAG
View on GitHub
☆16May 18, 2026Updated 2 months ago
PacktPublishing / Mastering-AI-Agents-for-Databases
View on GitHub
☆14Dec 15, 2025Updated 7 months ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
lin-jinwei / OneTo3D
View on GitHub
OneTo3D: One Image to Editable Dynamic 3D Model and Video Generation
☆15May 15, 2024Updated 2 years ago
taconite / MetaAvatar-release
View on GitHub
MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images
☆119Dec 6, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
Leey21 / CipherBank
View on GitHub
☆13Jun 13, 2025Updated last year
TAU-VAILab / HaLo-NeRF
View on GitHub
☆16Apr 30, 2024Updated 2 years ago
IBM / ColPret
View on GitHub
Efficient Scaling laws and collaborative pretraining.
☆22Updated this week
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
lt-asset / Waffle
View on GitHub
For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…
☆12May 28, 2025Updated last year
UNITES-Lab / C2R-MoE
View on GitHub
[NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…
☆16Feb 4, 2025Updated last year