corl-team/steering-reasoning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/corl-team/steering-reasoning)

corl-team / steering-reasoning

Official implementation of "Steering LLM Reasoning Through Bias-Only Adaptation" and "Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors"

☆54

Alternatives and similar repositories for steering-reasoning

Users that are interested in steering-reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

corl-team / flexsae
View on GitHub
Official Triton kernels for TopK and HierarchicalTopK Sparse Autoencoder decoders.
☆29Sep 29, 2025Updated 9 months ago
corl-team / counting_manifolds
View on GitHub
Code for the reproduction of counting manifolds
☆16Feb 26, 2026Updated 4 months ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
cvenhoff / thinking-llms-interp
View on GitHub
☆25Jul 8, 2026Updated 2 weeks ago
AIRI-Institute / SAE-Reasoning
View on GitHub
☆99Mar 28, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
tinkoff-ai / palbert
View on GitHub
Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight
☆37Apr 8, 2023Updated 3 years ago
yandex-research / AsyncReasoning
View on GitHub
☆25Jun 25, 2026Updated last month
MadryLab / D3M
View on GitHub
Debiasing Through Data Attribution
☆13May 23, 2024Updated 2 years ago
jayneelparekh / learn-to-steer
View on GitHub
[NeurIPS 2025] Official Implementation for Learning to Steer: Input-dependent Steering for Multimodal LLMs
☆19Dec 14, 2025Updated 7 months ago
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
cvenhoff / steering-thinking-llms
View on GitHub
☆39Jul 9, 2025Updated last year
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
RenlyH / CodeV
View on GitHub
[CVPR 2026 Oral] Code with Image
☆31Dec 5, 2025Updated 7 months ago
v-gen-ai / Marchuk
View on GitHub
Global Weather Forecasting from Mid-Range to Subseasonal Scale
☆19Mar 26, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dmbeaglehole / neural_controllers
View on GitHub
Code for steering and monitoring with concepts vectors in LLMs (initial draft)
☆31Aug 10, 2025Updated 11 months ago
d-tiapkin / gfnx
View on GitHub
Environments and Algorithms for Generative Flow Networks in JAX
☆90May 10, 2026Updated 2 months ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
corl-team / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆652Feb 10, 2024Updated 2 years ago
AIRI-Institute / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - —
☆23May 23, 2025Updated last year
GreatDrake / non-acyclic-gfn
View on GitHub
Repository for "Revisiting Non-Acyclic GFlowNets in Discrete Environments" (ICML 2025)
☆14Oct 8, 2025Updated 9 months ago
mmp-practicum-team / mmp_dl_spring
View on GitHub
Курс "Введение в глубокое обучение" для бакалавров 3 курса кафедры ММП ВМК МГУ, весенний семестр
☆34Jul 3, 2026Updated 3 weeks ago
JackKuo666 / a_numpy_based_implement_cnn
View on GitHub
这是我的博客《不用框架，使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。
☆10Jul 1, 2019Updated 7 years ago
yizhu-joy / DataFilter
View on GitHub
☆15Nov 29, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
amezqui3 / demeter
View on GitHub
Compute the Euler Characteristic Transform
☆17Oct 18, 2023Updated 2 years ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
Miaow-Lab / RLVR-Linearity
View on GitHub
[arXiv] "Linear Dynamics in the RLVR Training of Large Language Models"
☆17May 25, 2026Updated 2 months ago
angie-chen55 / pref-learning-ranking-acc
View on GitHub
☆13Jun 4, 2024Updated 2 years ago
RakitinDen / HSE-Diffusion-Models
View on GitHub
Course on diffusion models and modern generative modeling foundations taught at the Faculty of Computer Science of HSE University
☆46Jun 27, 2026Updated 3 weeks ago
equilicore / see-and-tell
View on GitHub
See and Tell | Assistant to Experience Video Content With Ears
☆21Oct 3, 2023Updated 2 years ago
liangyupu / DIMTDA
View on GitHub
The official repository of "Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling"
☆14Nov 26, 2025Updated 7 months ago
mochuishle / Thesis-Review-Skill
View on GitHub
Review your thesis from the perspective of a reviewer. Distilling Years of Experience from a University Instructor.
☆15May 3, 2026Updated 2 months ago
LzVv123456 / VISTA
View on GitHub
☆86Jul 28, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
stanfordnlp / axbench
View on GitHub
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆210Mar 12, 2026Updated 4 months ago
fallnlove / TwoToInfinity
View on GitHub
[AAAI-2026] Respository for the paper "Matrix-Free Two-to-Infinity and One-to-Two Norms Estimation"
☆18Mar 20, 2026Updated 4 months ago
THU-KEG / SafetyNeuron
View on GitHub
Data and code for the paper: Finding Safety Neurons in Large Language Models
☆29Jan 29, 2026Updated 5 months ago
deepvk / emospeech
View on GitHub
☆129Aug 19, 2024Updated last year
hwanchang00 / ChatInject
View on GitHub
[ICLR 2026] Official implementation of "ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents"
☆17Mar 23, 2026Updated 4 months ago
IamAnkita-create / Auto-Speech-model-for-an-image-for-blind-people
View on GitHub
People who suffer from low vision, sight and visual impairment are not able to see words and letters in ordinary newsprint, books and mag…
☆10Oct 1, 2020Updated 5 years ago
SALT-NLP / CODA
View on GitHub
Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization
☆10Mar 7, 2022Updated 4 years ago