MaxBelitsky / cache-steeringLinks
KV Cache Steering for Inducing Reasoning in Small Language Models
☆44Updated 6 months ago
Alternatives and similar repositories for cache-steering
Users that are interested in cache-steering are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- ☆55Updated last year
- Lottery Ticket Adaptation☆39Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated 3 months ago
- ☆82Updated 2 months ago
- A repository for research on medium sized language models.☆77Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆58Updated last week
- ☆91Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 11 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Updated last year
- ☆29Updated 2 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Updated last year
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆63Updated 3 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Updated 3 weeks ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆41Updated 2 years ago
- UQ: Assessing Language Models on Unsolved Questions☆30Updated 5 months ago
- ☆48Updated last year
- ☆59Updated 2 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Updated 4 months ago
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago