duykhuongnguyen/MAT-Steer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/duykhuongnguyen/MAT-Steer)

duykhuongnguyen / MAT-Steer

☆21

Alternatives and similar repositories for MAT-Steer

Users that are interested in MAT-Steer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DanielSc4 / Dynamic-Activation-Composition
View on GitHub
Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"
☆14Nov 22, 2024Updated last year
xypan0 / G-DIG
View on GitHub
☆12Jun 30, 2024Updated 2 years ago
tim-lawson / mlsae
View on GitHub
Multi-Layer Sparse Autoencoders (ICLR 2025)
☆30Feb 6, 2026Updated 5 months ago
MikaStars39 / FeatureAlignment
View on GitHub
FeatureAlignment = Alignment + Mechanistic Interpretability
☆35Mar 8, 2025Updated last year
HuyNguyen-hust / flash-attn-101
View on GitHub
☆22Sep 3, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
bino282 / ViNLP
View on GitHub
☆17Oct 30, 2022Updated 3 years ago
fiveai / understanding_safety_finetuning
View on GitHub
Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)
☆12Oct 31, 2024Updated last year
CaoYuanpu / BiPO
View on GitHub
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
☆50Jul 28, 2024Updated last year
jorispos / ConceptorSteering
View on GitHub
☆16Mar 13, 2025Updated last year
Butanium / tiny-activation-dashboard
View on GitHub
A tiny easily hackable implementation of a feature dashboard.
☆17Oct 21, 2025Updated 9 months ago
efarrell1 / train_sparse_autoencoder
View on GitHub
Trains Sparse Autoencoders based on outputs from language models
☆11Oct 7, 2024Updated last year
kyleliang919 / Online-Subspace-Descent
View on GitHub
[NeurIPS 2024] Low rank memory efficient optimizer without SVD
☆33Jul 1, 2025Updated last year
khainb / CSW
View on GitHub
A novel variant of sliced Wasserstein based on a new slicing technique that utilizes the convolution operator.
☆12Jan 14, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
FlyingPumba / InterpBench
View on GitHub
A benchmark for mechanistic discovery of circuits in Transformers
☆17Dec 15, 2024Updated last year
Call-for-Code / UnityStarterKit
View on GitHub
This is a sample project for getting started with Unity and data visualization.
☆11Jun 5, 2020Updated 6 years ago
smartyfh / DST-ASSIST
View on GitHub
ASSIST: Towards Label Noise-Robust Dialogue State Tracking
☆10Apr 11, 2022Updated 4 years ago
Alrope123 / prompt-waywardness
View on GitHub
☆14Apr 27, 2022Updated 4 years ago
lauhaide / clads
View on GitHub
XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…
☆10Nov 4, 2022Updated 3 years ago
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
ckkissane / crosscoder-model-diff-replication
View on GitHub
Open source replication of Anthropic's Crosscoders for Model Diffing
☆68Oct 27, 2024Updated last year
naver-ai / cs-shortcut
View on GitHub
Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)
☆18Nov 24, 2022Updated 3 years ago
JasonGross / guarantees-based-mechanistic-interpretability
View on GitHub
☆18Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qingyue2014 / MoE4DST
View on GitHub
☆12Jul 18, 2023Updated 3 years ago
Chengyuann / AutoStyle-TTS
View on GitHub
Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…
☆26Feb 1, 2026Updated 5 months ago
SoYoungCho / Korean-English-NMT
View on GitHub
Neural Machine Translation model for Capstone Project
☆11Apr 11, 2020Updated 6 years ago
hexiaoxiao-cs / DICE
View on GitHub
☆16May 10, 2026Updated 2 months ago
MaxTorop / DISCO
View on GitHub
Code for "DISCO: Disentangled Communication Steering for Large Language Models" (NeurIPS 2025) https://openreview.net/pdf?id=c8AjdgdHnD
☆16Oct 29, 2025Updated 8 months ago
yumoxu / oreo
View on GitHub
☆13Sep 27, 2022Updated 3 years ago
criticalml-uw / TamperBench
View on GitHub
Toolkit to benchmark the tamper-resistance of LLMs.
☆27May 15, 2026Updated 2 months ago
multimodal-art-projection / I-SHEEP
View on GitHub
I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment
☆17Jan 16, 2025Updated last year
stanfordnlp / axbench
View on GitHub
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆210Mar 12, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
The-Inscrutable-X / TACQ
View on GitHub
Official Repository for Task-Circuit Quantization
☆28Jun 1, 2025Updated last year
sar-mo / CS2051-HonorsDiscreteMath
View on GitHub
A collection of resources for CS 2051, an undergraduate Honors Discrete Mathematics course at Georgia Tech.
☆11Apr 29, 2026Updated 2 months ago
MANGA-UOFA / Prompt-Edit
View on GitHub
An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"
☆13Dec 9, 2023Updated 2 years ago
jayneelparekh / learn-to-steer
View on GitHub
[NeurIPS 2025] Official Implementation for Learning to Steer: Input-dependent Steering for Multimodal LLMs
☆19Dec 14, 2025Updated 7 months ago
Hung-Thinh-Innovation / VFastPunct
View on GitHub
Fast Punctuation Restoration using Transformer Models for Vietnamese
☆11Jun 10, 2022Updated 4 years ago
korokes / MCLS
View on GitHub
Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos
☆10Sep 2, 2024Updated last year
Confirm-Solutions / dreamy
View on GitHub
Fluent dreaming for language models
☆13Jul 22, 2024Updated last year