Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"
☆14Nov 22, 2024Updated last year
Alternatives and similar repositories for Dynamic-Activation-Composition
Users that are interested in Dynamic-Activation-Composition are comparing it to the libraries listed below
Sorting:
- Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.☆36Sep 26, 2024Updated last year
- ☆21Aug 8, 2025Updated 7 months ago
- Interpretating the latent space representations of attention head outputs for LLMs☆39Aug 13, 2024Updated last year
- ☆13Oct 5, 2025Updated 5 months ago
- The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"☆10Jun 23, 2024Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆77Jan 16, 2026Updated 2 months ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 6 months ago
- The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…☆24Sep 11, 2024Updated last year
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆43Jul 28, 2024Updated last year
- Steering Llama 2 with Contrastive Activation Addition☆214May 23, 2024Updated last year
- GHC-WPC is an extended GHC that exports the STG and other IR (.modpak) for the compiled modules and linker metadata (.ghc_stgapp) at appl…☆10Feb 14, 2024Updated 2 years ago
- A collection of Italian benchmarks for LLM evaluation☆37Dec 2, 2025Updated 3 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆140Feb 21, 2025Updated last year
- First two front-end phases of a language-independent compiler implemented in C++☆14Jul 27, 2020Updated 5 years ago
- The official repo for the Dialz Python library - a toolkit for steering vector research.☆22Jul 9, 2025Updated 8 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- 🗿 gigachadify any face☆24Jul 7, 2024Updated last year
- ☆119Feb 11, 2025Updated last year
- LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing☆10Jun 1, 2022Updated 3 years ago
- Interface for generating Categorical Query Language files to merge/migrate data between databases☆14Aug 9, 2021Updated 4 years ago
- Code from the Functional Scala Conference talk: Type inference for plain-text SQL☆10Dec 5, 2024Updated last year
- Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs☆12Nov 7, 2024Updated last year
- A library for writing chemical and biological data management systems☆10Oct 24, 2019Updated 6 years ago
- ☆12Jul 28, 2024Updated last year
- A library for efficient patching and automatic circuit discovery.☆92Dec 31, 2025Updated 2 months ago
- Vision-Language based Visual Object Tracking☆28Oct 10, 2025Updated 5 months ago
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- 构建一个医疗领域知识图谱和一个基于Flask的简易网页聊天机器人,通过ner获取用户问题的实体并在知识图谱内提取答案。☆12Apr 25, 2023Updated 2 years ago
- PRODeep: A Platform for Robustness Verification of Deep Neural Networks☆12Nov 11, 2020Updated 5 years ago
- ☆15Aug 15, 2025Updated 7 months ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Implementation of my agent used in 2025 AFAC TianChi competition☆28Oct 6, 2025Updated 5 months ago
- Links to publications that focus on the interpretation and analysis of in-context learning☆15Oct 17, 2024Updated last year
- A Compiler made for the Decaf Language using Flex, Bison and LLVM IR for generating code dump and interpreting.☆11Nov 30, 2018Updated 7 years ago
- A semi-automated system based on LLM's to generate ontologies from datasets☆24Oct 29, 2024Updated last year
- A curated list of resources for activation engineering☆132Oct 2, 2025Updated 5 months ago
- R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)☆13Nov 17, 2016Updated 9 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 10 months ago
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆29Dec 20, 2024Updated last year