AngelaZZZ-611/reasoning_models_probing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AngelaZZZ-611/reasoning_models_probing)

AngelaZZZ-611 / reasoning_models_probing

☆22

Alternatives and similar repositories for reasoning_models_probing

Users that are interested in reasoning_models_probing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
UMass-Embodied-AGI / BudgetGuidance
View on GitHub
[ACL'26 Findings] Steering LLM Thinking with Budget Guidance
☆33Feb 19, 2026Updated 5 months ago
staymylove / COT_Compresstion_via_Step_entropy
View on GitHub
☆29Aug 8, 2025Updated 11 months ago
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Jul 18, 2026Updated last week
K1nght / RAIN-Merging
View on GitHub
[ICLR 2026 Oral] RAIN-Merging
☆15Mar 9, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
YuanheZ / DAG-MATH
View on GitHub
[ICLR2026] DAG-Math: Graph-of-Thought Guided Mathematical Reasoning in LLMs
☆23Oct 19, 2025Updated 9 months ago
yihuaihong / Dissecting-FT-Unlearning
View on GitHub
[EMNLP 2024 Main] Code for the paper "Dissecting Fine-Tuning Unlearning in Large Language Models"
☆14Oct 10, 2024Updated last year
PreckLi / MIP-Editor
View on GitHub
Official implementation of Cross-Modal Unlearning via Influential Neuron Path Editing in Multimodal Large Language Models
☆16Mar 21, 2026Updated 4 months ago
MattYoon / reasoning-models-confidence
View on GitHub
[NeurIPS 2025] Reasoning Models Better Express Their Confidence"
☆23Nov 19, 2025Updated 8 months ago
QwenLM / PolyMath
View on GitHub
[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"
☆43May 22, 2025Updated last year
chicosirius / think-or-not
View on GitHub
☆22May 23, 2025Updated last year
Interplay-LM-Reasoning / Interplay-LM-Reasoning
View on GitHub
[ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
☆164Jun 8, 2026Updated last month
wutaiqiang / MI
View on GitHub
Official code for paper "Revisiting Model Interpolation for Efficient Reasoning"
☆17Jul 14, 2026Updated 2 weeks ago
lingchen0331 / UQ_ICL
View on GitHub
Uncertainty quantification for in-context learning of large language models
☆15Apr 1, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kaistAI / Knowledge-Entropy
View on GitHub
[ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
☆17Nov 25, 2024Updated last year
beanie00 / self-distillation-analysis
View on GitHub
Codebase for the work “Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?”
☆75Apr 14, 2026Updated 3 months ago
kaistAI / knowledge-reasoning
View on GitHub
[EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…
☆23Dec 4, 2024Updated last year
Zanette-Labs / speed-rl
View on GitHub
☆18Feb 2, 2026Updated 5 months ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
shivamag125 / EM_PT
View on GitHub
☆33Aug 21, 2025Updated 11 months ago
zz-haooo / STEER
View on GitHub
The implementation of ACL 2026 paper "Rethinking entropy interventions in rlvr: An entropy change perspective"
☆26Jul 19, 2026Updated last week
ASTRAL-Group / LoRe
View on GitHub
When Reasoning Meets Its Laws
☆38Jan 2, 2026Updated 6 months ago
lasgroup / user_interactions
View on GitHub
Aligning Language Models from User Interactions via Self-Distillation
☆26Mar 31, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
launchnlp / LitCab
View on GitHub
☆25Jun 10, 2025Updated last year
taoszhang / MMhops-R1
View on GitHub
MMhops-R1: Multimodal Multi-hop Reasoning
☆16Feb 28, 2026Updated 5 months ago
cvenhoff / thinking-llms-interp
View on GitHub
☆25Jul 8, 2026Updated 3 weeks ago
sbi-benchmark / diffeqtorch
View on GitHub
DifferentialEquations.jl with PyTorch
☆11Oct 12, 2022Updated 3 years ago
backprop07 / Self-Certainty
View on GitHub
Implementation of self-certainty as an extention of ZeroEval Project
☆38May 31, 2025Updated last year
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
THU-KEG / DICE
View on GitHub
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
☆12Sep 21, 2024Updated last year
MiaoXiong2320 / llm-uncertainty
View on GitHub
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
☆148Mar 14, 2024Updated 2 years ago
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
AlphaLab-USTC / LRM-plans-CoT
View on GitHub
[NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"
☆31Jul 6, 2025Updated last year
MikaStars39 / FeatureAlignment
View on GitHub
FeatureAlignment = Alignment + Mechanistic Interpretability
☆35Mar 8, 2025Updated last year
SWE-bench / reading-list
View on GitHub
Academic papers and works related to SWE-bench and SWE-agents
☆15Dec 8, 2025Updated 7 months ago
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 5 months ago
interactivebench / InteractiveBench
View on GitHub
Official Project Page for Interactive Benchmarks
☆31May 12, 2026Updated 2 months ago
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
OATML / semantic-entropy-probes
View on GitHub
☆65Jul 12, 2026Updated 2 weeks ago