Code release for the paper "Style Vectors for Steering Generative Large Language Models", accepted to the Findings of the EACL 2024.
☆36Sep 26, 2024Updated last year
Alternatives and similar repositories for style-vectors-for-steering-llms
Users that are interested in style-vectors-for-steering-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- Steering vectors for transformer language models in Pytorch / Huggingface☆149Feb 21, 2025Updated last year
- ☆15Jan 16, 2025Updated last year
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Steering Llama 2 with Contrastive Activation Addition☆232May 23, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆45Jul 28, 2024Updated last year
- Algebraic value editing in pretrained language models☆70Nov 1, 2023Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Apr 5, 2022Updated 4 years ago
- ☆26Aug 8, 2025Updated 9 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆103Sep 21, 2023Updated 2 years ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆198Feb 13, 2025Updated last year
- A library for making RepE control vectors☆727Sep 24, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Another dynamically-typed, lightweight programming language☆12May 5, 2015Updated 11 years ago
- R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)☆13Nov 17, 2016Updated 9 years ago
- JIT compiler of scheme☆15Jul 29, 2011Updated 14 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 8 months ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- A graph visualization of attention☆56May 20, 2025Updated last year
- ☆18Jan 17, 2024Updated 2 years ago
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆12Sep 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LLM手撕代码合集☆21Mar 25, 2025Updated last year
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- An IRC bot for Common Lisp code evaluation☆25Sep 12, 2025Updated 8 months ago
- ☆23May 20, 2025Updated last year
- 🧮 Algebraic Positional Encodings.☆20Aug 20, 2025Updated 9 months ago
- Learning about objects and their properties by interacting with them☆12Oct 21, 2020Updated 5 years ago
- A simplified port of LayoutParser for detecting layout elements on documents.☆14Jun 3, 2024Updated last year
- Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"☆22May 6, 2025Updated last year
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆24Oct 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Own solutions for exercises in "Database System Concepts - 7th edition" by Silberschatz, Korth and Sudarshan.☆21Jun 11, 2024Updated last year
- [NeurIPS 2024] Source code for our paper "Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models".☆13Jul 18, 2025Updated 10 months ago
- This repository is the official implementation of the Hybrid Self-Attention NEAT algorithm. It contains the code to reproduce the results…☆14Jun 19, 2023Updated 2 years ago
- Data split, feature extraction with count vectorizer☆16Feb 27, 2018Updated 8 years ago
- Code for TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings, EMNLP 2024 Findings☆29Nov 18, 2024Updated last year
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated 2 years ago
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated 2 years ago