AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead of always thinking or never thinking, the model learns when to engage in explicit reasoning, balancing performance and efficiency.
☆51Oct 14, 2025Updated 7 months ago
Alternatives and similar repositories for AutoThink
Users that are interested in AutoThink are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for SIGIR-2021 full paper: Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations☆11Aug 3, 2021Updated 4 years ago
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆29May 8, 2026Updated 2 weeks ago
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆114Aug 15, 2025Updated 9 months ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆22Feb 26, 2026Updated 3 months ago
- ☆32Jun 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CrysText: A Generative AI Approach for Text-Conditioned Crystal Structure Generation using LLM☆18Nov 3, 2025Updated 6 months ago
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated 2 years ago
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated last year
- mainly aimed at scalable subspace clustering☆11Jun 2, 2017Updated 8 years ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆257Sep 26, 2025Updated 8 months ago
- Multimodal Federated Learning on IoT Data☆11Dec 17, 2023Updated 2 years ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.☆14Mar 18, 2024Updated 2 years ago
- ☆12Dec 13, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The official implementation of paper "Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic Anchors…☆15Jun 14, 2024Updated last year
- Momentum Contrast for Unsupervised Visual Representation Learning☆16Mar 24, 2023Updated 3 years ago
- [ECMLPKDD 2020] "Topological Insights into Sparse Neural Networks"☆13May 2, 2022Updated 4 years ago
- Self-Supervised Dataset Distillation for Transfer Learning☆18Apr 10, 2024Updated 2 years ago
- Deep Co-Clustering (SDM'19)☆15Dec 18, 2021Updated 4 years ago
- Minimal code for A Generalist Agent☆44Nov 4, 2022Updated 3 years ago
- [AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving☆37Dec 23, 2025Updated 5 months ago
- ☆14Dec 1, 2025Updated 5 months ago
- The official code and data for paper "VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI"☆18Mar 25, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆19Feb 25, 2023Updated 3 years ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆21Jan 27, 2025Updated last year
- [ACM SoCC'22] Pisces: Efficient Federated Learning via Guided Asynchronous Training☆13Apr 28, 2025Updated last year
- 星搭低代码AI助手插件,使用 StableDiffusion 和 ChatGPT 生成插画和文案☆11Mar 22, 2023Updated 3 years ago
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆20Jul 6, 2023Updated 2 years ago
- ☆16Nov 26, 2024Updated last year
- ☆51Mar 20, 2026Updated 2 months ago
- DiffCSP: Finding Browser Bugs in Content Security Policy Enforcement through Differential Testing☆17Feb 27, 2023Updated 3 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Jul 31, 2025Updated 9 months ago
- ☆24Jul 16, 2024Updated last year
- ☆31Jun 5, 2025Updated 11 months ago
- ☆18Dec 20, 2023Updated 2 years ago
- ☆12Jul 24, 2024Updated last year
- ☆15Nov 21, 2022Updated 3 years ago
- A Unified and General Framework for Continual Learning, ICLR 2024☆15Mar 22, 2024Updated 2 years ago