Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.
☆36Sep 26, 2024Updated last year
Alternatives and similar repositories for style-vectors-for-steering-llms
Users that are interested in style-vectors-for-steering-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Steering vectors for transformer language models in Pytorch / Huggingface☆152Feb 21, 2025Updated last year
- Steering Llama 2 with Contrastive Activation Addition☆236May 23, 2024Updated 2 years ago
- A resource repository for representation engineering in large language models☆153Nov 14, 2024Updated last year
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆47Jul 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Algebraic value editing in pretrained language models☆70Nov 1, 2023Updated 2 years ago
- ☆123Feb 11, 2025Updated last year
- Exercism exercises in Tcl.☆11Jun 3, 2026Updated 2 weeks ago
- ☆34Oct 15, 2024Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Apr 5, 2022Updated 4 years ago
- ☆27Aug 8, 2025Updated 10 months ago
- ☆30Aug 2, 2024Updated last year
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2025] General-purpose activation steering library☆175Sep 18, 2025Updated 9 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆103Sep 21, 2023Updated 2 years ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆33Sep 18, 2025Updated 9 months ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Links to publications that focus on the interpretation and analysis of in-context learning☆15Oct 17, 2024Updated last year
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆13May 2, 2021Updated 5 years ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- A library for making RepE control vectors☆731Sep 24, 2025Updated 8 months ago
- R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)☆13Nov 17, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 9 months ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- ☆19Sep 1, 2025Updated 9 months ago
- A graph visualization of attention☆56May 20, 2025Updated last year
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆11Sep 29, 2024Updated last year
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- ☆23May 20, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Python/Cython package for graph edit distances and graph matching☆13Jan 30, 2023Updated 3 years ago
- Conversational Recommender System Evaluation via Simulation☆19Jun 9, 2026Updated last week
- Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning☆21Aug 20, 2025Updated 9 months ago
- A simplified port of LayoutParser for detecting layout elements on documents.☆14Jun 3, 2024Updated 2 years ago
- Improving Alignment and Robustness with Circuit Breakers☆263Sep 24, 2024Updated last year
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated last year
- ☆14Mar 16, 2023Updated 3 years ago