Lingkai-Kong/RE-Control

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Lingkai-Kong/RE-Control)

Lingkai-Kong / RE-Control

Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective

☆35

Alternatives and similar repositories for RE-Control

Users that are interested in RE-Control are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LiuAmber / RAHF
View on GitHub
[ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…
☆28Sep 25, 2024Updated last year
deeplearning-wisc / args
View on GitHub
☆47Feb 8, 2024Updated 2 years ago
night-chen / DyGen
View on GitHub
[KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".
☆11Jun 14, 2023Updated 3 years ago
sfeucht / footprints
View on GitHub
https://footprints.baulab.info
☆17Oct 4, 2024Updated last year
mlwu22 / RED
View on GitHub
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆15Apr 20, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
weakrules / Denoise-multi-weak-sources
View on GitHub
☆19Jun 4, 2020Updated 6 years ago
likenneth / honest_llama
View on GitHub
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆581Jan 28, 2025Updated last year
facebookresearch / SOC-matching
View on GitHub
Code release for "Stochastic Optimal Control Matching"
☆43Aug 14, 2024Updated last year
haotiansun14 / BBox-Adapter
View on GitHub
Lightweight Adapting for Black-Box Large Language Models
☆25Feb 15, 2024Updated 2 years ago
ritaranx / BMRetriever
View on GitHub
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
☆26Sep 19, 2024Updated last year
MANGA-UOFA / PTfer
View on GitHub
☆11Nov 13, 2024Updated last year
chrisliu298 / awesome-representation-engineering
View on GitHub
A resource repository for representation engineering in large language models
☆156Nov 14, 2024Updated last year
Lingkai-Kong / Calibrated-BERT-Fine-Tuning
View on GitHub
Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
☆36Nov 16, 2020Updated 5 years ago
LYang-666 / TRGP
View on GitHub
[ICLR 2022] Official Code Repository for "TRGP: TRUST REGION GRADIENT PROJECTION FOR CONTINUAL LEARNING"
☆22Oct 5, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
qingjiesjtu / USC
View on GitHub
This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.
☆61Dec 20, 2024Updated last year
chentong0 / copy-bench
View on GitHub
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation
☆14Aug 19, 2025Updated 11 months ago
ai-resilience / yonsei_ml3_CoPe
View on GitHub
☆15Dec 16, 2025Updated 7 months ago
ai-resilience / yonsei_ml3_PriME
View on GitHub
☆15Dec 16, 2025Updated 7 months ago
zoom-wang112358 / MOLLEO
View on GitHub
Source code of MOLLEO
☆57Jul 8, 2025Updated last year
lasgroup / SafetyPolytope
View on GitHub
Learning Safety Constraints for Large Language Models (ICML2025)
☆35May 25, 2026Updated last month
simonasnow / MultilingualPerspectivistNLU
View on GitHub
☆10May 30, 2024Updated 2 years ago
zepingyu0512 / neuron-attribution
View on GitHub
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆52Nov 17, 2024Updated last year
StefanHeng / Symbolic-Music-Generation
View on GitHub
Symbolic music generation taking inspiration from NLP and human composition process
☆18Jun 28, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yuzhaouoe / SAE-based-representation-engineering
View on GitHub
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆83Jun 20, 2026Updated last month
ernoult / targetProp
View on GitHub
Testing Difference Target Propagation (DTP) on MNIST.
☆13Oct 12, 2020Updated 5 years ago
FatemehShiri / Spatial-MM
View on GitHub
☆12Jan 10, 2025Updated last year
bond005 / impartial_text_cls
View on GitHub
Text classifier, based on the BERT and a Bayesian neural network, which can train on small labeled texts and doubt its decision.
☆14Mar 24, 2023Updated 3 years ago
garywei944 / ChemFlow
View on GitHub
Uncover meaningful structures of latent spaces learned by generative models with flows!
☆47May 10, 2024Updated 2 years ago
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
dsam99 / QueRE
View on GitHub
Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".
☆12Jan 9, 2025Updated last year
ictnlp / TruthX
View on GitHub
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
☆144Mar 26, 2024Updated 2 years ago
llyx97 / FETV-EVAL
View on GitHub
Evaluation of generated videos on the FETV benchmark
☆10Apr 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
KingJamesSong / latent-flow
View on GitHub
NeurIPS23 "Flow Factorized Representation Learning"
☆46Dec 15, 2025Updated 7 months ago
yangzhou6666 / Beyond-Accurate-Code-Models
View on GitHub
A collection of publications that works on code models but beyond focusing on the accuracies.
☆12Jun 30, 2023Updated 3 years ago
Frostielocks / CAN-Fuzzer
View on GitHub
A fuzzer for the CAN bus
☆18Mar 1, 2025Updated last year
poseidonchan / ProteinWatermark
View on GitHub
Injecting watermarks to protein sequences for privacy protection in biosecurity
☆10Oct 1, 2024Updated last year
ezeli / bottom_up_features_extract
View on GitHub
An PyTorch reimplementation of bottom-up-attention models
☆16Jan 5, 2021Updated 5 years ago
ZhenlanJi / DL_CC
View on GitHub
CC: Causality-Aware Coverage Criterion for Deep Neural Networks
☆12Feb 15, 2023Updated 3 years ago
EdwinKim3069 / WaRP-CIFSL
View on GitHub
Official implementation for paper titled "Warping the Space: Weight Space Rotation for Class-Incremental Few-Shot Learning" accepted at I…
☆21Jun 18, 2024Updated 2 years ago