MaxBelitsky / cache-steeringLinks
KV Cache Steering for Inducing Reasoning in Small Language Models
☆28Updated this week
Alternatives and similar repositories for cache-steering
Users that are interested in cache-steering are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated 11 months ago
- ☆23Updated last month
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 6 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 3 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆43Updated 3 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 11 months ago
- A repository for research on medium sized language models.☆77Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated 2 weeks ago
- ☆66Updated 3 months ago
- ☆45Updated last month
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆38Updated 2 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆25Updated 3 months ago
- Resa: Transparent Reasoning Models via SAEs☆40Updated last month
- ☆53Updated 8 months ago
- ☆47Updated 9 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆19Updated last month
- ☆24Updated 10 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆39Updated 4 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- Official Repository for Task-Circuit Quantization☆20Updated last month
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆20Updated this week
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 7 months ago
- Verifiers for LLM Reinforcement Learning☆65Updated 3 months ago
- ☆33Updated 2 months ago
- ☆13Updated 7 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 6 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆33Updated 4 months ago