Experiments with representation engineering
☆13Feb 28, 2024Updated 2 years ago
Alternatives and similar repositories for repeng
Users that are interested in repeng are comparing it to the libraries listed below
Sorting:
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆19Oct 11, 2024Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆19Dec 14, 2024Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆20Oct 18, 2024Updated last year
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 5 months ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- [ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…☆28Sep 25, 2024Updated last year
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated 2 weeks ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆34Oct 28, 2025Updated 4 months ago
- A library for efficient patching and automatic circuit discovery.☆90Dec 31, 2025Updated 2 months ago
- Auditing agents for fine-tuning safety☆20Oct 21, 2025Updated 4 months ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆11Oct 23, 2023Updated 2 years ago
- Archive of questions from the Cambridge Mathematics Tripos☆10Jun 6, 2022Updated 3 years ago
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooks☆11Jul 19, 2019Updated 6 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- ☆273Oct 1, 2024Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- ☆12May 9, 2021Updated 4 years ago
- Course Info for VIP-GEAI☆11Apr 11, 2024Updated last year
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆13Dec 16, 2024Updated last year
- Some realistic tabular datasets for testing (CSV)☆21Mar 7, 2018Updated 7 years ago
- A library for training crosscoders☆16May 28, 2025Updated 9 months ago
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- An analog touch screen joystick that pretends to be a bevy gamepad☆13Jul 13, 2024Updated last year
- The AI that helps you achieve your goals☆11Feb 4, 2024Updated 2 years ago
- Attempt to understand Percy Liang's Dependency-based Compositional Semantics by implementing it in Python☆10Mar 10, 2013Updated 12 years ago
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Apr 28, 2024Updated last year
- Customizable charts made with TikZ and LaTeX3☆14Feb 11, 2023Updated 3 years ago
- ☆13Feb 18, 2026Updated last week
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- Fast wavelet transforms on the sphere☆13Dec 20, 2016Updated 9 years ago
- A tool for visualization of complex job searches.☆13Jul 8, 2022Updated 3 years ago
- Automated terminal emulator benchmarks☆22Jan 14, 2026Updated last month
- Flight Recorder allows to record client program execution and examine it later☆11Sep 18, 2020Updated 5 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 8 months ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- Web Scraping monster.com using scrapy with JSON APIs☆10Oct 18, 2019Updated 6 years ago