Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.
☆36Sep 26, 2024Updated last year
Alternatives and similar repositories for style-vectors-for-steering-llms
Users that are interested in style-vectors-for-steering-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Steering Llama 2 with Contrastive Activation Addition☆228May 23, 2024Updated last year
- The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"☆10Jun 23, 2024Updated last year
- A resource repository for representation engineering in large language models☆151Nov 14, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆45Jul 28, 2024Updated last year
- Algebraic value editing in pretrained language models☆70Nov 1, 2023Updated 2 years ago
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated last year
- ☆34Oct 15, 2024Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Apr 5, 2022Updated 4 years ago
- ☆24Aug 8, 2025Updated 9 months ago
- ☆30Aug 2, 2024Updated last year
- [ICLR 2025] General-purpose activation steering library☆167Sep 18, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs☆12Nov 7, 2024Updated last year
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆104Sep 21, 2023Updated 2 years ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 7 months ago
- The official repo for the DanQing dataset.☆35Mar 25, 2026Updated last month
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Links to publications that focus on the interpretation and analysis of in-context learning☆15Oct 17, 2024Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- A library for making RepE control vectors☆723Sep 24, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)☆13Nov 17, 2016Updated 9 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 7 months ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- ☆18Sep 1, 2025Updated 8 months ago
- A graph visualization of attention☆56May 20, 2025Updated 11 months ago
- ☆18Jan 17, 2024Updated 2 years ago
- LLM手撕代码合集☆21Mar 25, 2025Updated last year
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 🧮 Algebraic Positional Encodings.☆20Aug 20, 2025Updated 8 months ago
- Conversational Recommender System Evaluation via Simulation☆19Apr 21, 2026Updated 2 weeks ago
- Learning about objects and their properties by interacting with them☆12Oct 21, 2020Updated 5 years ago
- Official implementation of the paper “Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning”☆20Aug 20, 2025Updated 8 months ago
- A simplified port of LayoutParser for detecting layout elements on documents.☆14Jun 3, 2024Updated last year
- Code for ACL 2021 paper "Unsupervised Out-of-Domain Detection via Pre-trained Transformers"☆30Aug 20, 2021Updated 4 years ago
- Improving Alignment and Robustness with Circuit Breakers☆261Sep 24, 2024Updated last year