DanielSc4/Dynamic-Activation-Composition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DanielSc4/Dynamic-Activation-Composition)

DanielSc4 / Dynamic-Activation-Composition

Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"

☆14

Alternatives and similar repositories for Dynamic-Activation-Composition

Users that are interested in Dynamic-Activation-Composition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DLR-SC / style-vectors-for-steering-llms
View on GitHub
Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.
☆37Sep 26, 2024Updated last year
msakarvadia / AttentionLens
View on GitHub
Interpretating the latent space representations of attention head outputs for LLMs
☆39Aug 13, 2024Updated last year
CaoYuanpu / BiPO
View on GitHub
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
☆50Jul 28, 2024Updated last year
ordavid-s / snmf-mlp-decomposition
View on GitHub
☆15Jul 7, 2026Updated last week
lyh6560new / P3Sum
View on GitHub
The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"
☆10Jun 23, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yuzhaouoe / SAE-based-representation-engineering
View on GitHub
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆83Jun 20, 2026Updated 3 weeks ago
bryanchrist / MathNeuro
View on GitHub
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆23Jun 15, 2025Updated last year
marcopoli / LLaMAntino-3-ANITA
View on GitHub
The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…
☆24Sep 11, 2024Updated last year
nrimsky / CAA
View on GitHub
Steering Llama 2 with Contrastive Activation Addition
☆240May 23, 2024Updated 2 years ago
grin-compiler / ghc-wpc
View on GitHub
GHC-WPC is an extended GHC that exports the STG and other IR (.modpak) for the compiled modules and linker metadata (.ghc_stgapp) at appl…
☆10Feb 14, 2024Updated 2 years ago
SapienzaNLP / ita-bench
View on GitHub
A collection of Italian benchmarks for LLM evaluation
☆37Jun 9, 2026Updated last month
microsoft / llm-steer-instruct
View on GitHub
A method for steering llms to better follow instructions
☆96Jun 10, 2026Updated last month
duykhuongnguyen / MAT-Steer
View on GitHub
☆21Aug 19, 2025Updated 11 months ago
staymylove / COT_Compresstion_via_Step_entropy
View on GitHub
☆27Aug 8, 2025Updated 11 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
steering-vectors / steering-vectors
View on GitHub
Steering vectors for transformer language models in Pytorch / Huggingface
☆157Feb 21, 2025Updated last year
IBM / sae-steering
View on GitHub
Code to enable layer-level steering in LLMs using sparse auto encoders
☆34Sep 18, 2025Updated 10 months ago
Elzawawy / compiler-frontend
View on GitHub
First two front-end phases of a language-independent compiler implemented in C++
☆14Jul 27, 2020Updated 5 years ago
thesofakillers / gigachadify
View on GitHub
🗿 gigachadify any face
☆24Jul 7, 2024Updated 2 years ago
ALT-JS / OthelloSAE
View on GitHub
CS194-196 Course Project
☆14Feb 20, 2025Updated last year
KihoPark / linear_rep_geometry
View on GitHub
Code for 'The Linear Representation Hypothesis and the Geometry of Large Language Models' (ICML 2024)
☆125Feb 11, 2025Updated last year
ElementAI / lagr
View on GitHub
LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing
☆10Jun 1, 2022Updated 4 years ago
qizhangli / Gradient-based-Jailbreak-Attacks
View on GitHub
Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs
☆12Nov 7, 2024Updated last year
kris-brown / cql_data_integration
View on GitHub
Interface for generating Categorical Query Language files to merge/migrate data between databases
☆14Aug 9, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
KuceraMartin / Type-inference-for-plain-text-SQL
View on GitHub
Code from the Functional Scala Conference talk: Type inference for plain-text SQL
☆10Dec 5, 2024Updated last year
declare-lab / safety-arithmetic
View on GitHub
☆13Jan 14, 2025Updated last year
stefan-hoeck / cyby2
View on GitHub
A library for writing chemical and biological data management systems
☆10Oct 24, 2019Updated 6 years ago
joshterrell805 / OpenIntro_Statistics_Labs
View on GitHub
R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)
☆13Nov 17, 2016Updated 9 years ago
tofu-tf / shalter
View on GitHub
☆12Jul 28, 2024Updated last year
weixuan-wang123 / SADI
View on GitHub
☆19Sep 1, 2025Updated 10 months ago
slavachalnev / SAE-TS
View on GitHub
Improving Steering Vectors by Targeting Sparse Autoencoder Features
☆28Nov 20, 2024Updated last year
UFO-101 / auto-circuit
View on GitHub
A library for efficient patching and automatic circuit discovery.
☆99Dec 31, 2025Updated 6 months ago
FlyingPumba / InterpBench
View on GitHub
A benchmark for mechanistic discovery of circuits in Transformers
☆17Dec 15, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zyxnlp / ICL-Interpretation-Analysis-Resources
View on GitHub
Links to publications that focus on the interpretation and analysis of in-context learning
☆14Oct 17, 2024Updated last year
open-compass / RePro
View on GitHub
[ICLR 2026] Rectifying LLM Thought From Lens of Optimization
☆15Dec 5, 2025Updated 7 months ago
lsds / MultiKernelBOSS
View on GitHub
☆15Aug 15, 2025Updated 11 months ago
iscas-tis / PRODeep
View on GitHub
PRODeep: A Platform for Robustness Verification of Deep Neural Networks
☆12Nov 11, 2020Updated 5 years ago
honeyandme / knowledge
View on GitHub
构建一个医疗领域知识图谱和一个基于Flask的简易网页聊天机器人，通过ner获取用户问题的实体并在知识图谱内提取答案。
☆12Apr 25, 2023Updated 3 years ago
deeplearning-wisc / LUMINA
View on GitHub
Official implementation of ICLR 2026 paper "LUMINA: Detecting Hallucinations in RAG System with Context–Knowledge Signals"
☆18Jan 31, 2026Updated 5 months ago
xiongjiechen / Normalizing-Flows-DPFs
View on GitHub
☆10Jan 3, 2025Updated last year