MaxBelitsky / cache-steeringLinks
KV Cache Steering for Inducing Reasoning in Small Language Models
☆44Updated 5 months ago
Alternatives and similar repositories for cache-steering
Users that are interested in cache-steering are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Updated last year
- ☆82Updated last month
- A repository for research on medium sized language models.☆77Updated last year
- Lottery Ticket Adaptation☆40Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated 3 months ago
- ☆55Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 11 months ago
- ☆91Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Updated 11 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- ☆48Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Updated 2 weeks ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆66Updated last year
- Resa: Transparent Reasoning Models via SAEs☆47Updated 3 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- When Reasoning Meets Its Laws☆33Updated last week
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- MatFormer repo☆67Updated last year
- The first dense retrieval model that can be prompted like an LM☆89Updated 8 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆51Updated 3 months ago
- ☆26Updated 2 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- ☆21Updated 5 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆62Updated 3 months ago
- ☆52Updated last year
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Updated last year