☆179Dec 5, 2025Updated 2 months ago
Alternatives and similar repositories for AdaptThink
Users that are interested in AdaptThink are comparing it to the libraries listed below
Sorting:
- ☆21Feb 22, 2026Updated last week
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆253Sep 26, 2025Updated 5 months ago
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆110Aug 15, 2025Updated 6 months ago
- ☆34Jan 25, 2026Updated last month
- ☆21Jul 21, 2025Updated 7 months ago
- ☆33Nov 18, 2025Updated 3 months ago
- ☆145Sep 12, 2025Updated 5 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Sep 29, 2025Updated 5 months ago
- ☆60Jan 12, 2026Updated last month
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆18Oct 17, 2025Updated 4 months ago
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆38Oct 8, 2025Updated 4 months ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆30Jul 6, 2025Updated 7 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- ☆58Jun 7, 2025Updated 8 months ago
- ☆46Sep 27, 2025Updated 5 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- Code for Heima☆59Apr 21, 2025Updated 10 months ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆26Nov 20, 2025Updated 3 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆80May 30, 2025Updated 9 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆66Jun 10, 2025Updated 8 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆76Jul 18, 2025Updated 7 months ago
- [ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)☆892Jan 28, 2026Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆262May 14, 2025Updated 9 months ago
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆770Sep 7, 2025Updated 5 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆97Feb 21, 2025Updated last year
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- [COLM2025] "Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors"☆55Oct 6, 2025Updated 4 months ago
- ☆14Mar 20, 2025Updated 11 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- Work in progress.☆79Nov 25, 2025Updated 3 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 8 months ago
- ☆47Nov 8, 2024Updated last year
- ☆29Nov 9, 2025Updated 3 months ago