shangshang-wang/Resa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shangshang-wang/Resa)

shangshang-wang / Resa

Resa: Transparent Reasoning Models via SAEs

☆50

Alternatives and similar repositories for Resa

Users that are interested in Resa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AIRI-Institute / SAE-Reasoning
View on GitHub
☆99Mar 28, 2025Updated last year
esteng / regal_program_learning
View on GitHub
☆27Sep 11, 2024Updated last year
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆32Nov 12, 2024Updated last year
ssfgunner / VL-SAE
View on GitHub
[NeurIPS 2025] This is the official repository for VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Se…
☆15Oct 29, 2025Updated 8 months ago
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sparkle-reasoning / sparkle
View on GitHub
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
☆16Dec 12, 2025Updated 7 months ago
LLM360 / TxT360
View on GitHub
☆25Dec 18, 2024Updated last year
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 4 months ago
foreverlasting1202 / QuestA
View on GitHub
☆22Jan 2, 2026Updated 6 months ago
belindal / state-tracking
View on GitHub
Code and data for paper "(How) do Language Models Track State?"
☆26Mar 31, 2025Updated last year
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
Miaow-Lab / RLVR-Linearity
View on GitHub
[arXiv] "Linear Dynamics in the RLVR Training of Large Language Models"
☆17May 25, 2026Updated last month
UMass-Embodied-AGI / BudgetGuidance
View on GitHub
[ACL'26 Findings] Steering LLM Thinking with Budget Guidance
☆32Feb 19, 2026Updated 5 months ago
wutaiqiang / MI
View on GitHub
Official code for paper "Revisiting Model Interpolation for Efficient Reasoning"
☆17Jul 14, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
s-sahoo / Eso-LMs
View on GitHub
[ICML 2026] Esoteric Language Models
☆121Jul 13, 2026Updated last week
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
gouki510 / Topology_of_Reasoning
View on GitHub
☆42Jun 11, 2025Updated last year
VITA-Group / SEAL
View on GitHub
[COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆60Apr 6, 2025Updated last year
Lossfunk / KernelBench-v2
View on GitHub
KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems
☆24Jul 4, 2025Updated last year
Aries-iai / Manifold_Steering
View on GitHub
The official implementation for "Mitigating Overthinking in Large Reasoning Models via Manifold Steering"
☆15May 29, 2025Updated last year
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 6 months ago
jinhangzhan / RL_Heals_SFT
View on GitHub
☆21Mar 22, 2026Updated 3 months ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated 11 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
cvenhoff / steering-thinking-llms
View on GitHub
☆38Jul 9, 2025Updated last year
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
zqOuO / GWT
View on GitHub
☆13May 4, 2026Updated 2 months ago
Table-R1 / Table-R1
View on GitHub
[EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"
☆32Jun 3, 2025Updated last year
stallone0000 / Reasoning-Skill
View on GitHub
☆20May 25, 2026Updated last month
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
JinaLeejnl / 2D-TPE
View on GitHub
2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)
☆10Apr 15, 2025Updated last year
analokmaus / kaggle-aimo2-fast-math-r1
View on GitHub
Kaggle AIMO2 solution with token-efficient reasoning LLM recipes
☆50Aug 7, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kyutai-labs / ARC-Encoder
View on GitHub
☆30Jan 5, 2026Updated 6 months ago
ZunhaiSu / Super-Experts-Profilling
View on GitHub
(ICLR 2026) Unveiling Super Experts in Mixture-of-Experts Large Language Models
☆43Sep 25, 2025Updated 9 months ago
a-little-hoof / Uni_Instruct
View on GitHub
(NeurIPS 2025) Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction
☆19Nov 2, 2025Updated 8 months ago
Leey21 / CipherBank
View on GitHub
☆13Jun 13, 2025Updated last year
OpenMOSS / Lorsa
View on GitHub
☆30Nov 9, 2025Updated 8 months ago
RadicalNumerics / spear
View on GitHub
Structured Primitives for Efficient Architecture Research
☆20Dec 22, 2025Updated 6 months ago
thu-coai / AutoDetect
View on GitHub
Official github repo for AutoDetect, an automated weakness detection framework for LLMs.
☆46Jun 25, 2024Updated 2 years ago