An exploration of LLM steering
☆26Jun 15, 2024Updated last year
Alternatives and similar repositories for activation_steering
Users that are interested in activation_steering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Coherence boosting: When your pretrained language model is not paying enough attention (ACL 2022) https://arxiv.org/abs/2110.08294☆15Apr 23, 2023Updated 3 years ago
- ☆46Feb 8, 2024Updated 2 years ago
- Codebase for public release of the plug-and-blend framework.☆23Mar 29, 2022Updated 4 years ago
- Reinforcement learning with Equinox☆20Mar 4, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆16Apr 15, 2026Updated last month
- Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning☆24Jun 25, 2025Updated 11 months ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.☆65Nov 1, 2025Updated 6 months ago
- 基于bert的文本情感分析☆12Nov 4, 2022Updated 3 years ago
- ☆14Feb 24, 2025Updated last year
- Embedded Rust Projects☆13Jun 12, 2024Updated last year
- ☆12Jun 29, 2024Updated last year
- FormulaNet is a new large-scale Mathematical Formula Detection dataset.☆21Nov 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- Sentiment analysis has been a popular field in natural language processing. Sentiments can be expressed explicitly or implicitly. Most cur…☆16Nov 3, 2021Updated 4 years ago
- Batch downloader and Scraper for Pico-8 carts.☆18Aug 21, 2025Updated 9 months ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆14Mar 1, 2025Updated last year
- A Keyboard Pad, supported 8 keys + 4 retray encoders or 12 keys.☆12Dec 16, 2023Updated 2 years ago
- ☆11Apr 29, 2020Updated 6 years ago
- [NAACL 2024] TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition☆17Jan 5, 2026Updated 4 months ago
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆72May 7, 2026Updated 2 weeks ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 多模态情感分析☆17Jul 14, 2023Updated 2 years ago
- Parametric differentiable curves with PyTorch for continuous embeddings, shape-restricted models, or KANs☆64Apr 24, 2026Updated last month
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆35Jun 23, 2025Updated 11 months ago
- Code of ICLR 2025 paper "DynaPrompt: Dynamic Test-Time Prompt Tuning"☆22Jan 29, 2025Updated last year
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- Templates for paper submissions, technical questionnaires, etc.☆14Sep 13, 2024Updated last year
- Code for reproducing the results from "CrAM: A Compression-Aware Minimizer" accepted at ICLR 2023☆10Mar 1, 2023Updated 3 years ago
- ☆30May 4, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Benchmark AFLOW Data Sets for Machine Learning doi.org/10.1007/s40192-020-00174-4☆11Aug 29, 2020Updated 5 years ago
- An opinionated approach to have type safety in native JavaScript.☆10Jan 11, 2022Updated 4 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 4 months ago
- 电动笛子!☆12Jul 13, 2024Updated last year
- MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering☆14May 3, 2024Updated 2 years ago
- [ICML 2025] Official Implementation of "Hessian Geometry of Latent Space in Generative Models"☆18Aug 16, 2025Updated 9 months ago
- Handwriting Analysis for Detection of Personality Traits☆17Feb 11, 2019Updated 7 years ago