☆182Dec 5, 2025Updated 3 months ago
Alternatives and similar repositories for AdaptThink
Users that are interested in AdaptThink are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead …☆50Oct 14, 2025Updated 5 months ago
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆114Aug 15, 2025Updated 7 months ago
- ☆21Updated this week
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆254Sep 26, 2025Updated 5 months ago
- [NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"☆32Jul 6, 2025Updated 8 months ago
- ☆34Nov 18, 2025Updated 4 months ago
- ☆60Jun 7, 2025Updated 9 months ago
- ☆146Sep 12, 2025Updated 6 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆98Feb 21, 2025Updated last year
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- Code for Heima☆59Apr 21, 2025Updated 11 months ago
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆68Oct 28, 2025Updated 4 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆63May 22, 2025Updated 10 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆262May 14, 2025Updated 10 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆78Jul 18, 2025Updated 8 months ago
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 8 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆54Sep 29, 2025Updated 5 months ago
- [ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)☆916Jan 28, 2026Updated last month
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- ☆27Jul 18, 2025Updated 8 months ago
- ☆62Oct 29, 2024Updated last year
- Official implementation of "Reasoning by Superposition: A Theoretical Perspective on Chain of Continuous Thought" (NeurIPS 2025)☆38Oct 8, 2025Updated 5 months ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆29Oct 20, 2025Updated 5 months ago
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆15May 27, 2025Updated 9 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆260Mar 7, 2026Updated 2 weeks ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆102Dec 24, 2024Updated last year
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆349Jan 22, 2026Updated 2 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆123May 19, 2025Updated 10 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆65Mar 10, 2026Updated 2 weeks ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 9 months ago
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆773Sep 7, 2025Updated 6 months ago
- ☆35Jan 25, 2026Updated last month
- Work in progress.☆79Nov 25, 2025Updated 3 months ago
- [NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios☆29Dec 1, 2025Updated 3 months ago
- ☆30May 22, 2024Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆83May 30, 2025Updated 9 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆66Jun 10, 2025Updated 9 months ago