LightChen233/reasoning-boundary

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LightChen233/reasoning-boundary)

LightChen233 / reasoning-boundary

☆71

Alternatives and similar repositories for reasoning-boundary

Users that are interested in reasoning-boundary are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

princeton-nlp / continual-factoid-memorization
View on GitHub
Continual Memorization of Factoids in Large Language Models
☆12Nov 20, 2024Updated last year
princeton-pli / what-makes-good-rm
View on GitHub
[NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective
☆44Sep 18, 2025Updated 10 months ago
LightChen233 / M3CoT
View on GitHub
☆92Mar 12, 2026Updated 4 months ago
Unakar / Efficient_AI
View on GitHub
此项目是我个人对MIT 6.5940 课程作业的答案，学习笔记和心得。
☆15Mar 1, 2024Updated 2 years ago
edenbiran / HoppingTooLate
View on GitHub
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆33Mar 2, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kokolerk / TON
View on GitHub
[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆58Sep 29, 2025Updated 9 months ago
sunblaze-ucb / omega
View on GitHub
☆47Jun 24, 2025Updated last year
MasterVito / SvS
View on GitHub
Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training
☆54Dec 13, 2025Updated 7 months ago
AkideLiu / MiniCache
View on GitHub
☆14Sep 7, 2024Updated last year
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
sail-sg / CPO
View on GitHub
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
☆137Mar 21, 2025Updated last year
GAIR-NLP / OlympicArena
View on GitHub
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆106Mar 6, 2025Updated last year
RLHFlow / GVM
View on GitHub
☆16Jul 29, 2025Updated 11 months ago
liunian-Jay / AgenticRAG-RL
View on GitHub
A minimal implementation of Agentic RAG using GRPO
☆17Jun 11, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
da03 / Internalize_CoT_Step_by_Step
View on GitHub
☆209Apr 19, 2025Updated last year
haolunc / iGSM-Replication-physics-LLM
View on GitHub
This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.
☆17Sep 13, 2024Updated last year
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
whunextgen / LLMindCraft
View on GitHub
Shaping Language Models with Cognitive Insights
☆15Feb 29, 2024Updated 2 years ago
SparkJiao / dpo-trajectory-reasoning
View on GitHub
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆84Jan 14, 2025Updated last year
shenao-zhang / BARL
View on GitHub
Bayes-Adaptive RL for LLM Reasoning
☆45May 28, 2025Updated last year
Alsace08 / OOD-Math-Reasoning
View on GitHub
[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆28May 28, 2024Updated 2 years ago
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated last month
shivamag125 / EM_PT
View on GitHub
☆33Aug 21, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
GeniusHTX / TALE
View on GitHub
☆151Sep 12, 2025Updated 10 months ago
GuanghaoYe / Emergence-of-Thinking
View on GitHub
☆55Feb 11, 2025Updated last year
iiis-ai / IterativeQuestionComposing
View on GitHub
[AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)
☆23Oct 2, 2025Updated 9 months ago
wizard-III / Archer2.0
View on GitHub
Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…
☆31Oct 10, 2025Updated 9 months ago
song-wx / SIFT
View on GitHub
[ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely
☆24Jun 26, 2024Updated 2 years ago
Eclipsess / Awesome-Efficient-Reasoning-LLMs
View on GitHub
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
☆784Feb 28, 2026Updated 4 months ago
Zayne-sprague / MuSR
View on GitHub
☆57Aug 10, 2024Updated last year
yliu-cs / PiTe
View on GitHub
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
☆17Feb 13, 2025Updated last year
chengtan9907 / mc-cot
View on GitHub
The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…
☆26May 19, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
StarDewXXX / O1-Pruner
View on GitHub
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆100Feb 21, 2025Updated last year
RUCAIBox / ELMER
View on GitHub
This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…
☆26Oct 27, 2022Updated 3 years ago
StevenZHB / CoT_Causal_Analysis
View on GitHub
Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"
☆23Feb 19, 2025Updated last year
GeorgeVern / lmcor
View on GitHub
Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"
☆12Apr 20, 2024Updated 2 years ago
LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning
View on GitHub
Latest Advances on Long Chain-of-Thought Reasoning
☆647Jul 18, 2025Updated last year
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
genrm-star / genrm-critiques
View on GitHub
GenRM-CoT: Data release for verification rationales
☆68Oct 16, 2024Updated last year