[COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
☆37Jun 5, 2025Updated 9 months ago
Alternatives and similar repositories for MiP-Overthinking
Users that are interested in MiP-Overthinking are comparing it to the libraries listed below
Sorting:
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Sep 27, 2025Updated 5 months ago
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 5 months ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆16Jun 28, 2024Updated last year
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆26Mar 26, 2025Updated 11 months ago
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆29Feb 6, 2026Updated last month
- ☆20Oct 10, 2025Updated 4 months ago
- ☆32Oct 13, 2025Updated 4 months ago
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆24Sep 14, 2024Updated last year
- ☆25Nov 19, 2025Updated 3 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆30Oct 20, 2025Updated 4 months ago
- A comprehensive React Native starter template built with Expo. It includes reusable UI components, Poppins font setup, NativeWind, Fireba…☆23Updated this week
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 7 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆144Nov 13, 2025Updated 3 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated 2 months ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago
- ☆35May 16, 2025Updated 9 months ago
- A Sober Look at Language Model Reasoning☆93Nov 18, 2025Updated 3 months ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆28May 17, 2025Updated 9 months ago
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Mar 24, 2025Updated 11 months ago
- Glitch Gremlin AI☆15Apr 5, 2025Updated 11 months ago
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- ☆46Mar 4, 2025Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Sep 29, 2025Updated 5 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated 2 months ago
- A powerful AI prompt engineering tool that transforms simple instructions into detailed, context-rich prompts using Google's Gemini Pro t…☆15Aug 28, 2025Updated 6 months ago
- AutonomousSphere is an agentic collaboration server. Agents talk, act, and use tools like teammates. Federated servers form an internet o…☆16May 13, 2025Updated 9 months ago
- Shakey OS Mobile AI Framework for React Native allowing people to build React Native apps for IOS and Android with AI tooling and wallet …☆28Feb 3, 2025Updated last year
- "Open-source toolkit (Python Library, Registry API, CLI) for secure, decentralized AI agent interoperability using A2A/MCP."☆14May 10, 2025Updated 9 months ago
- 📱 A template for your next React Native project: Expo, TypeScript, ReStyle, Husky, react-navigation, react-query, react-hook-form, zusta…☆16Dec 15, 2025Updated 2 months ago
- 💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.☆15Oct 23, 2025Updated 4 months ago
- AI Tasks. A LLM integrated agent orchestration tool for Liferay.☆14May 16, 2025Updated 9 months ago
- IBM watsonx Code Assistant for Red Hat Ansible Lightspeed demystifies the process of Ansible Playbook creation through generative AI-powe…☆19Sep 18, 2025Updated 5 months ago
- Emphasizes AI-based projects for various companies.☆15Apr 1, 2025Updated 11 months ago
- ☆13Aug 12, 2022Updated 3 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- SYSTEM PROMPT TRANSPARENCY FOR ALL☆12May 22, 2025Updated 9 months ago
- Hierarchical Attention Network based Explainable Knowledge Tracing☆10May 18, 2022Updated 3 years ago
- ☆14Apr 4, 2025Updated 11 months ago