DLR-SC/style-vectors-for-steering-llms

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DLR-SC/style-vectors-for-steering-llms)

DLR-SC / style-vectors-for-steering-llms

Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.

☆37

Alternatives and similar repositories for style-vectors-for-steering-llms

Users that are interested in style-vectors-for-steering-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DanielSc4 / Dynamic-Activation-Composition
View on GitHub
Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"
☆14Nov 22, 2024Updated last year
wenlai-lavine / sNeuron-TST
View on GitHub
EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer
☆14Mar 23, 2025Updated last year
steering-vectors / steering-vectors
View on GitHub
Steering vectors for transformer language models in Pytorch / Huggingface
☆159Feb 21, 2025Updated last year
Ellenzzn / PersLLM
View on GitHub
☆16Jan 16, 2025Updated last year
chrisliu298 / awesome-representation-engineering
View on GitHub
A resource repository for representation engineering in large language models
☆156Nov 14, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nrimsky / CAA
View on GitHub
Steering Llama 2 with Contrastive Activation Addition
☆241May 23, 2024Updated 2 years ago
lyh6560new / P3Sum
View on GitHub
The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"
☆10Jun 23, 2024Updated 2 years ago
KihoPark / linear_rep_geometry
View on GitHub
Code for 'The Linear Representation Hypothesis and the Geometry of Large Language Models' (ICML 2024)
☆125Feb 11, 2025Updated last year
bryanchrist / MathNeuro
View on GitHub
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆24Jun 15, 2025Updated last year
montemac / activation_additions
View on GitHub
Algebraic value editing in pretrained language models
☆71Nov 1, 2023Updated 2 years ago
yuzhaouoe / SAE-based-representation-engineering
View on GitHub
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆83Jun 20, 2026Updated last month
exercism / tcl
View on GitHub
Exercism exercises in Tcl.
☆11Updated this week
TaoZhen1110 / CAT-LLM
View on GitHub
☆35Oct 15, 2024Updated last year
EleutherAI / steering-llama3
View on GitHub
☆30Aug 2, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ntunlp / coherence-paradigm
View on GitHub
Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"
☆11Apr 5, 2022Updated 4 years ago
shuo-git / VecConstNMT
View on GitHub
☆25Oct 22, 2022Updated 3 years ago
mattf1n / basis-aware-threshold
View on GitHub
Code for the paper "Closing the Curious Case of Neural Text Degeneration"
☆12Apr 9, 2025Updated last year
nrimsky / LM-exp
View on GitHub
LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces
☆105Sep 21, 2023Updated 2 years ago
GraySwanAI / circuit-breakers
View on GitHub
Improving Alignment and Robustness with Circuit Breakers
☆266Sep 24, 2024Updated last year
qizhangli / Gradient-based-Jailbreak-Attacks
View on GitHub
Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs
☆12Nov 7, 2024Updated last year
UKPLab / on-emergence
View on GitHub
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Jan 9, 2025Updated last year
jiaohuix / nmt_data_tools
View on GitHub
machine translation data process tools
☆10Apr 29, 2024Updated 2 years ago
sophie-xhonneux / Continuous-AdvTrain
View on GitHub
☆36Apr 13, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
declare-lab / safety-arithmetic
View on GitHub
☆13Jan 14, 2025Updated last year
jzbjyb / lm-calibration
View on GitHub
☆34Nov 17, 2021Updated 4 years ago
joshterrell805 / OpenIntro_Statistics_Labs
View on GitHub
R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)
☆13Nov 17, 2016Updated 9 years ago
slavachalnev / SAE-TS
View on GitHub
Improving Steering Vectors by Targeting Sparse Autoencoder Features
☆29Nov 20, 2024Updated last year
ScalingIntelligence / Archon
View on GitHub
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆207Mar 7, 2025Updated last year
deeplearning-wisc / args
View on GitHub
☆47Feb 8, 2024Updated 2 years ago
pariajm / english-fisher-annotations
View on GitHub
A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset
☆13May 2, 2021Updated 5 years ago
zyxnlp / ICL-Interpretation-Analysis-Resources
View on GitHub
Links to publications that focus on the interpretation and analysis of in-context learning
☆14Oct 17, 2024Updated last year
open-compass / RePro
View on GitHub
[ICLR 2026] Rectifying LLM Thought From Lens of Optimization
☆15Dec 5, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
li012589 / HMC_pytorch
View on GitHub
A pytorch version of hamiltonian monte carlo
☆15Jun 26, 2019Updated 7 years ago
vgel / repeng
View on GitHub
A library for making RepE control vectors
☆746Sep 24, 2025Updated 10 months ago
shengliu66 / ICV
View on GitHub
Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
☆201Feb 13, 2025Updated last year
GXimingLu / a_star_neurologic
View on GitHub
☆43Mar 24, 2023Updated 3 years ago
CaoYuanpu / BiPO
View on GitHub
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
☆50Jul 28, 2024Updated 2 years ago
oshears / adv-ml-2020-snn-project
View on GitHub
Advanced Machine Learning Fall 2020 Project Repository
☆12Dec 12, 2020Updated 5 years ago
thevasudevgupta / transformers-adapters
View on GitHub
This repositary hosts my experiments for the project, I did with OffNote Labs.
☆10Apr 12, 2021Updated 5 years ago