Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.
☆36Sep 26, 2024Updated last year
Alternatives and similar repositories for style-vectors-for-steering-llms
Users that are interested in style-vectors-for-steering-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Steering Llama 2 with Contrastive Activation Addition☆220May 23, 2024Updated last year
- The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"☆10Jun 23, 2024Updated last year
- A resource repository for representation engineering in large language models☆149Nov 14, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆43Jul 28, 2024Updated last year
- Algebraic value editing in pretrained language models☆69Nov 1, 2023Updated 2 years ago
- ☆119Feb 11, 2025Updated last year
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated last year
- ☆32Oct 15, 2024Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Apr 5, 2022Updated 3 years ago
- ☆22Aug 8, 2025Updated 7 months ago
- ☆30Aug 2, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [ICLR 2025] General-purpose activation steering library☆153Sep 18, 2025Updated 6 months ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated 11 months ago
- Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs☆12Nov 7, 2024Updated last year
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆103Sep 21, 2023Updated 2 years ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 6 months ago
- The official repo for the DanQing dataset.☆33Updated this week
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Links to publications that focus on the interpretation and analysis of in-context learning☆15Oct 17, 2024Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A library for making RepE control vectors☆699Sep 24, 2025Updated 6 months ago
- R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)☆13Nov 17, 2016Updated 9 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 6 months ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- ☆16Sep 1, 2025Updated 6 months ago
- ☆18Jan 17, 2024Updated 2 years ago
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆12Sep 29, 2024Updated last year
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A Python/Cython package for graph edit distances and graph matching☆13Jan 30, 2023Updated 3 years ago
- coord2state.js maps latitude and longitude coordinates to their corresponding US state☆30Jun 18, 2025Updated 9 months ago
- ☆14Mar 16, 2023Updated 3 years ago
- Conversational Recommender System Evaluation via Simulation☆19Updated this week
- Learning about objects and their properties by interacting with them☆12Oct 21, 2020Updated 5 years ago
- Official implementation of the paper “Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning”☆20Aug 20, 2025Updated 7 months ago
- A simplified port of LayoutParser for detecting layout elements on documents.☆13Jun 3, 2024Updated last year