tianyi-lab/MiP-Overthinking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tianyi-lab/MiP-Overthinking)

tianyi-lab / MiP-Overthinking

[COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

☆39

Alternatives and similar repositories for MiP-Overthinking

Users that are interested in MiP-Overthinking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MingLiiii / Gradient_Unified
View on GitHub
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
☆20Jun 17, 2025Updated last year
tianyi-lab / Moltbook_Socialization
View on GitHub
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook
☆18Feb 17, 2026Updated 5 months ago
tianyi-lab / Mosaic-IT
View on GitHub
[ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning
☆20Sep 27, 2025Updated 10 months ago
zxiangx / LC-R1
View on GitHub
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆29Oct 20, 2025Updated 9 months ago
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 10 months ago
tianyi-lab / DEBATunE
View on GitHub
[ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
☆25Sep 14, 2024Updated last year
tianyi-lab / ColorBench
View on GitHub
[NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…
☆40Sep 27, 2025Updated 10 months ago
elated-sawyer / WALL-E
View on GitHub
Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents
☆63Dec 3, 2025Updated 7 months ago
QizhiPei / MathFusion
View on GitHub
MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)
☆37Jul 16, 2025Updated last year
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
stallone0000 / Reasoning-Skill
View on GitHub
☆20May 25, 2026Updated 2 months ago
Lichang-Chen / claude2-alpaca
View on GitHub
First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!
☆13Oct 22, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
sunblaze-ucb / reasoning_ladder
View on GitHub
☆35May 16, 2025Updated last year
measure-infinity / mulan-code
View on GitHub
☆43Jul 16, 2024Updated 2 years ago
tianyi-lab / DisCL
View on GitHub
[ICCV 2025] Diffusion Curriculum (DisCL)
☆18Sep 26, 2025Updated 10 months ago
TEAM-ARM / arm
View on GitHub
[NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model
☆68Apr 6, 2026Updated 3 months ago
kai-wen-yang / IDAA
View on GitHub
[ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"
☆10Jul 24, 2022Updated 4 years ago
open-compass / RePro
View on GitHub
[ICLR 2026] Rectifying LLM Thought From Lens of Optimization
☆15Dec 5, 2025Updated 7 months ago
Yibin-Lei / MetaEOL
View on GitHub
Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"
☆12Jul 25, 2024Updated 2 years ago
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
JierunChen / SFT-RL-SynergyDilemma
View on GitHub
☆15Jan 14, 2026Updated 6 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
LivingFutureLab / DeltaBench
View on GitHub
☆46Mar 4, 2025Updated last year
VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
Leey21 / CipherBank
View on GitHub
☆14Jun 13, 2025Updated last year
limenlp / verl
View on GitHub
AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
☆56Jun 13, 2025Updated last year
beichenzbc / BoostStep
View on GitHub
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆37Jan 21, 2025Updated last year
alibaba-damo-academy / VL-Cogito
View on GitHub
☆24Nov 4, 2025Updated 8 months ago
IAAR-Shanghai / xVerify
View on GitHub
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
☆148Nov 13, 2025Updated 8 months ago
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,268Aug 27, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
THU-KEG / PairJudgeRM
View on GitHub
☆15Apr 14, 2025Updated last year
tianyi-lab / MoE-Embedding
View on GitHub
[ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆92Oct 15, 2024Updated last year
open-compass / CompassVerifier
View on GitHub
[EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
☆68Aug 10, 2025Updated 11 months ago
LAMDA-NeSy / Self-Backtracking
View on GitHub
☆52Feb 12, 2025Updated last year
vmicheli / lm-butlers
View on GitHub
☆12Aug 30, 2021Updated 4 years ago
LaVi-Lab / LongContextReasoner
View on GitHub
[ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners
☆20May 28, 2024Updated 2 years ago
Ayame1006 / LLMtoGraph
View on GitHub
☆10Aug 24, 2023Updated 2 years ago