microsoft/llm-steer-instruct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/llm-steer-instruct)

microsoft / llm-steer-instruct

A method for steering llms to better follow instructions

☆96

Alternatives and similar repositories for llm-steer-instruct

Users that are interested in llm-steer-instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuzhaouoe / SAE-based-representation-engineering
View on GitHub
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆83Jun 20, 2026Updated last month
Sreyan88 / ACLM
View on GitHub
Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER
☆22Jul 19, 2023Updated 3 years ago
jayneelparekh / learn-to-steer
View on GitHub
[NeurIPS 2025] Official Implementation for Learning to Steer: Input-dependent Steering for Multimodal LLMs
☆19Dec 14, 2025Updated 7 months ago
IBM / sae-steering
View on GitHub
Code to enable layer-level steering in LLMs using sparse auto encoders
☆34Sep 18, 2025Updated 10 months ago
armingh2000 / FactScoreLite
View on GitHub
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…
☆14Apr 25, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
duykhuongnguyen / MAT-Steer
View on GitHub
☆21Aug 19, 2025Updated 11 months ago
lyh6560new / P3Sum
View on GitHub
The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"
☆10Jun 23, 2024Updated 2 years ago
Evanwu1125 / LiteCoT
View on GitHub
☆17Jun 10, 2025Updated last year
rycolab / bayesian-mi
View on GitHub
This code accompanies the paper "Bayesian Framework for Information-Theoretic Probing" published in EMNLP 2021.
☆10Aug 23, 2021Updated 4 years ago
bpwu1 / confidence-regulation-neurons
View on GitHub
Confidence Regulation Neurons in Language Models (NeurIPS 2024)
☆15Feb 1, 2025Updated last year
QwenLM / PolyMath
View on GitHub
[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"
☆43May 22, 2025Updated last year
microsoft / PLOG
View on GitHub
☆23Jun 7, 2023Updated 3 years ago
microsoft / eval-recipes
View on GitHub
☆28Apr 21, 2026Updated 3 months ago
alestolfo / causal-math
View on GitHub
Code Repository for "A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models".
☆15Oct 14, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
microsoft / nlu-incremental-symbol-learning
View on GitHub
incremental symbol learning for natural language understanding
☆10Jun 12, 2023Updated 3 years ago
jinghuichen / FedCAMS
View on GitHub
Github Repo for ICML 2022 paper: Communication-Efficient Adaptive Federated Learning
☆10Nov 18, 2022Updated 3 years ago
kaistAI / Knowledge-Entropy
View on GitHub
[ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
☆17Nov 25, 2024Updated last year
swarnaHub / SummarizationPrograms
View on GitHub
[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
☆23Jun 19, 2023Updated 3 years ago
nrimsky / CAA
View on GitHub
Steering Llama 2 with Contrastive Activation Addition
☆241May 23, 2024Updated 2 years ago
alexkahn / iterable-api
View on GitHub
A Python wrapper for the Iterable API
☆12Jan 7, 2026Updated 6 months ago
AAAAAAsuka / llm_defends
View on GitHub
code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"
☆14Nov 17, 2023Updated 2 years ago
DLR-SC / style-vectors-for-steering-llms
View on GitHub
Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.
☆37Sep 26, 2024Updated last year
THUNLP-MT / L2Copy4APE
View on GitHub
Learning to Copy for Automatic Post-Editing (EMNLP 2019)
☆11May 6, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
microsoft / ReMe
View on GitHub
ReMe: A Personalized Cognitive Training Framework Based on an LLM Voice Chatbot for Research
☆18Jul 3, 2025Updated last year
Shrutii07 / Suduku-Solver-Generator
View on GitHub
Python code to generate any number of sudoku puzzles and their corresponding solvers.
☆10Dec 27, 2020Updated 5 years ago
edenbiran / RippleEdits
View on GitHub
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆57Apr 15, 2024Updated 2 years ago
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
microsoft / jrt-controller
View on GitHub
Real time controller for network functions instrumented with the jbpf framework
☆18Apr 24, 2026Updated 3 months ago
microsoft / dstoolkit-genai-shap
View on GitHub
SHAP (SHapley Additive exPlanations) for Generative AI (LLMs and SMLs) based solutions.
☆19Jul 4, 2025Updated last year
Y-Sui / FiDeLiS
View on GitHub
Code for Paper ACL'25: FiDELIS: Faithful Reasoning of Large Language Model on Knowledge Graph Question Answering
☆21May 8, 2025Updated last year
swarnaHub / ExplanationIntervention
View on GitHub
[NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind
☆66Dec 21, 2023Updated 2 years ago
aws-samples / aws-iam-identity-center-permission-policies-analyzer
View on GitHub
☆17Mar 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CaoYuanpu / BiPO
View on GitHub
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
☆50Jul 28, 2024Updated last year
alohays / openai-tool2mcp
View on GitHub
mcp wrapper for openai built-in tools
☆12Mar 13, 2025Updated last year
Shrutii07 / 8051-Programming
View on GitHub
Assembly and C codes to interface various components and communication protocols for 8051-microcontroller
☆10Apr 27, 2021Updated 5 years ago
sleeepeer / PISanitizer
View on GitHub
PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
☆18Dec 10, 2025Updated 7 months ago
safety-research / persona_vectors
View on GitHub
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
☆452Apr 22, 2026Updated 3 months ago
ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models
View on GitHub
This repository collects all relevant resources about interpretability in LLMs
☆402Nov 1, 2024Updated last year
lingxiao-li / HAE
View on GitHub
ICCV 2023: The Euclidean Space is Evil: Hyperbolic Attribute Editing for Few-shot Image Generation
☆15Sep 29, 2023Updated 2 years ago