sunblaze-ucb/reasoning_ladder

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sunblaze-ucb/reasoning_ladder)

sunblaze-ucb / reasoning_ladder

☆35

Alternatives and similar repositories for reasoning_ladder

Users that are interested in reasoning_ladder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MingLiiii / Gradient_Unified
View on GitHub
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
☆20Jun 17, 2025Updated last year
google-research-datasets / recognizing-multimodal-entailment
View on GitHub
The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …
☆14Aug 16, 2021Updated 4 years ago
janhq / space-thinker
View on GitHub
☆21Mar 25, 2025Updated last year
LHL3341 / MetaLadder
View on GitHub
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)
☆12Apr 18, 2025Updated last year
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
THUKElab / LatEval
View on GitHub
☆10Mar 19, 2024Updated 2 years ago
hamishivi / automated-instruction-selection
View on GitHub
Exploration of automated dataset selection approaches at large scales.
☆55Mar 4, 2025Updated last year
Leey21 / CipherBank
View on GitHub
☆14Jun 13, 2025Updated last year
YJiangcm / BMC
View on GitHub
[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
☆12Jan 26, 2025Updated last year
NVIDIA / NeMo-Inspector
View on GitHub
A tool for an analysis of LLM generations.
☆42Oct 13, 2025Updated 9 months ago
tianyi-lab / MiP-Overthinking
View on GitHub
[COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
☆39Jun 5, 2025Updated last year
EIT-NLP / Distilling-CoT-Reasoning
View on GitHub
[ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".
☆22Feb 26, 2025Updated last year
microsoft / x-reasoner
View on GitHub
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆49Feb 4, 2026Updated 5 months ago
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NineAbyss / S2R
View on GitHub
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆77Apr 22, 2025Updated last year
Yikai-Liao / efficient_bpe
View on GitHub
An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation
☆13Sep 9, 2024Updated last year
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
The-Inscrutable-X / TACQ
View on GitHub
Official Repository for Task-Circuit Quantization
☆28Jun 1, 2025Updated last year
cheryyunl / Make-An-Agent
View on GitHub
☆51Jul 22, 2024Updated 2 years ago
aeroplanepaper / GRPO-LEAD
View on GitHub
☆40Nov 18, 2025Updated 8 months ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
youngjoey-ai / tracerag
View on GitHub
一个强调工程化、可观测、可测试、可扩展的 RAG 项目。TraceRAG 的目标不是只把答案“生成出来”，而是把文档导入、切块、向量化、检索、带来源回答、评估与后续 tracing 拆成可独立验证的阶段，逐步演进成一个可维护、可解释、可复盘的生产级 RAG。
☆15Apr 2, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Victorwz / LaViA
View on GitHub
☆10Jul 13, 2024Updated 2 years ago
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
TIGER-AI-Lab / AceCoder
View on GitHub
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆100Apr 9, 2025Updated last year
JiaQiSJTU / FaithEval-FFLM
View on GitHub
A zero-shot faithfulness evaluation metric for text summarization
☆11Oct 17, 2023Updated 2 years ago
psunlpgroup / ReaLMistake
View on GitHub
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
☆32Aug 18, 2024Updated last year
bosmkamdi / BSCFLASHBOT
View on GitHub
Create and Deploy a Front Run Bot Sol Contract on BSC FLASHBOT
☆12Mar 19, 2022Updated 4 years ago
siyan-zhao / decision-stacks
View on GitHub
Implementation of Decision Stacks: Flexible RL via Modular Generative Models [NeurIPS 2023]
☆12Jun 27, 2023Updated 3 years ago
OpenMOSS / Lorsa
View on GitHub
☆30Nov 9, 2025Updated 8 months ago
beichenzbc / BoostStep
View on GitHub
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆37Jan 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UKPLab / cdcr-beyond-corpus-tailored
View on GitHub
📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora
☆10May 25, 2022Updated 4 years ago
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆229Nov 27, 2025Updated 7 months ago
SynthLabsAI / big-math
View on GitHub
A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
☆74Feb 25, 2025Updated last year
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 9 months ago
Zcchill / Value-Residual-Learning
View on GitHub
☆15Mar 20, 2025Updated last year
facebookresearch / ReasonIR
View on GitHub
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆230Jul 2, 2026Updated 3 weeks ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago