MaxBelitsky/cache-steering

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MaxBelitsky/cache-steering)

MaxBelitsky / cache-steering

KV Cache Steering for Inducing Reasoning in Small Language Models

☆50

Alternatives and similar repositories for cache-steering

Users that are interested in cache-steering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dkopi / Bitune
View on GitHub
Implementation of Bitune: Bidirectional Instruction-Tuning
☆27Jun 19, 2025Updated last year
vpariza / open-hummingbird-eval
View on GitHub
This is a repository that implements the Dense NN Retrieval Evaluation used for evaluating the In-Context Learning Capabilities of Vision…
☆32Nov 3, 2025Updated 8 months ago
SMSD75 / Timetuning
View on GitHub
Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23
☆30Dec 30, 2024Updated last year
Qualcomm-AI-research / llm-surgeon
View on GitHub
☆35May 24, 2024Updated 2 years ago
vpariza / NeCo
View on GitHub
"Near, far: Patch-ordering enhances vision foundation models' scene understanding": A New SSL Post-Training Approach for Improving DINOv2…
☆33Apr 20, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zzbright1998 / SentenceKV
View on GitHub
Official implementation of "SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching" (COLM 2025). A novel KV cache com…
☆15Sep 29, 2025Updated 9 months ago
floatingsun / transformer_layers_as_painters
View on GitHub
transformer layers behavior as painters🧑‍🎨
☆15May 6, 2025Updated last year
AgenticIR-Lab / OThink-R1
View on GitHub
This is the official code for OThink-R1 project.
☆21Jun 19, 2025Updated last year
lukasknobel / SelfCollages
View on GitHub
Learning to Count without Annotations
☆23May 24, 2024Updated 2 years ago
Evanwu1125 / LiteCoT
View on GitHub
☆17Jun 10, 2025Updated last year
abrvkh / explainability_toolkit
View on GitHub
☆14Dec 12, 2024Updated last year
mandyyyyii / east
View on GitHub
☆19Aug 4, 2025Updated 11 months ago
ahans30 / goldfish-loss
View on GitHub
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆98Nov 17, 2024Updated last year
Akshit21112002 / TTRV
View on GitHub
TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)
☆46Mar 8, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hangeol / UniR
View on GitHub
Official repo for paper: Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs
☆20Nov 26, 2025Updated 7 months ago
ozyyshr / RAST
View on GitHub
Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)
☆22Oct 16, 2025Updated 9 months ago
apple / ml-epicache
View on GitHub
☆30Oct 2, 2025Updated 9 months ago
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
seungwonpark / awesome-model-cards
View on GitHub
Resources related to the model cards for ML
☆11Mar 16, 2021Updated 5 years ago
sgvaze / clevr4
View on GitHub
Starter notebook and utilities for the Clevr-4 dataset
☆17Nov 1, 2023Updated 2 years ago
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆20Jul 15, 2026Updated last week
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
microsoft / dataflow2text
View on GitHub
Code for "The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction an…
☆10Apr 30, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JackKuo666 / a_numpy_based_implement_cnn
View on GitHub
这是我的博客《不用框架，使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。
☆10Jul 1, 2019Updated 7 years ago
Xingyu-Zheng / FOEM
View on GitHub
(AAAI 2026) First-Order Error Matters: Accurate Compensation for Quantized Large Language Models
☆16Apr 16, 2026Updated 3 months ago
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
probcomp / genlm-control
View on GitHub
☆13Apr 17, 2025Updated last year
JiwooKimAR / dmath
View on GitHub
☆12Feb 16, 2024Updated 2 years ago
songmzhang / DSKDv2
View on GitHub
The official implementation of the paper "A Dual-Space Framework for General Knowledge Distillation of Large Language Models".
☆18Jan 4, 2026Updated 6 months ago
liangyupu / DIMTDA
View on GitHub
The official repository of "Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling"
☆14Nov 26, 2025Updated 7 months ago
1KE-JI / UPFT
View on GitHub
Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…
☆20Jun 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
Stanford-AIMI / LieRE
View on GitHub
[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.
☆14Aug 8, 2025Updated 11 months ago
wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆19Oct 4, 2025Updated 9 months ago
BaohaoLiao / SAGE
View on GitHub
Self-Hinting Language Models Enhance Reinforcement Learning
☆26Mar 28, 2026Updated 3 months ago
Hoar012 / TDC-Video
View on GitHub
Official implementation of TDC.
☆15Jul 22, 2025Updated last year
aastroza / structured-generation-benchmark
View on GitHub
Structured Generation Evals
☆14Sep 25, 2024Updated last year
matthias-wright / cifar10-resnet
View on GitHub
PyTorch implementation of a 9-layer ResNet for CIFAR-10.
☆11May 8, 2024Updated 2 years ago