ScienceOne-AI/AutoThink

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ScienceOne-AI/AutoThink)

ScienceOne-AI / AutoThink

AutoThink is a reinforcement learning framework designed to equip R1-style language models with adaptive reasoning capabilities. Instead of always thinking or never thinking, the model learns when to engage in explicit reasoning, balancing performance and efficiency.

☆52

Alternatives and similar repositories for AutoThink

Users that are interested in AutoThink are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

THU-KEG / AdaptThink
View on GitHub
☆187Dec 5, 2025Updated 7 months ago
TU2021 / DPO-VP
View on GitHub
Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs
☆21Mar 20, 2025Updated last year
yannqi / R-4B
View on GitHub
The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
☆141Sep 4, 2025Updated 10 months ago
GLJS / AudioToolAgent
View on GitHub
GitHub repository for AudioToolAgent
☆20Feb 13, 2026Updated 5 months ago
staymylove / COT_Compresstion_via_Step_entropy
View on GitHub
☆29Aug 8, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
XuankunRong / SafeGRPO
View on GitHub
[CVPR'26] SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization
☆21Feb 19, 2026Updated 5 months ago
MetaCopilot / dseval
View on GitHub
☆33Jun 24, 2024Updated 2 years ago
VainF / Thinkless
View on GitHub
[NeurIPS 2025] Thinkless: LLM Learns When to Think
☆261Sep 26, 2025Updated 10 months ago
machine981 / SCOPE
View on GitHub
SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting
☆28Jun 22, 2026Updated last month
iie-ycx / DEER
View on GitHub
This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.
☆199Jul 7, 2025Updated last year
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
RUCAIBox / Passk_Training
View on GitHub
The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''
☆113Aug 15, 2025Updated 11 months ago
SalesforceAIResearch / Elastic-Reasoning
View on GitHub
Make reasoning models scalable
☆51Jun 2, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
deeplearning-wisc / picle
View on GitHub
Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)
☆28Jun 27, 2024Updated 2 years ago
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
spatigen / milr
View on GitHub
Official code of paper: MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning
☆18Feb 12, 2026Updated 5 months ago
facebookresearch / ToolVerifier
View on GitHub
This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.
☆23Mar 11, 2024Updated 2 years ago
lihongcs / LLM_Inception
View on GitHub
[ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".
☆13Jan 25, 2025Updated last year
zjunlp / LightThinker
View on GitHub
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆165Jun 22, 2026Updated last month
Top34051 / stargan-zsvc
View on GitHub
Unofficial PyTorch Implementation of StarGAN-ZSVC
☆14Aug 5, 2021Updated 4 years ago
DanjieTang / Diffusion-CIFAR10
View on GitHub
Training diffusion model with CIFAR10 dataset(insight from 13 papers)
☆16Jun 29, 2026Updated last month
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kokolerk / TON
View on GitHub
[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆58Sep 29, 2025Updated 10 months ago
penghao-wu / GUI_Reflection
View on GitHub
☆34Sep 19, 2025Updated 10 months ago
inclusionAI / Ming-Freeform-Audio-Edit
View on GitHub
☆15Oct 27, 2025Updated 9 months ago
mayug / 0-shot-llm-vision
View on GitHub
This repository contains the code for our CVPR 2024 paper,
☆16Aug 27, 2024Updated last year
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated last year
MBZUAI-CLeaR / IoE-Prompting
View on GitHub
☆11Feb 28, 2024Updated 2 years ago
lampts / wsdm19cup
View on GitHub
My 1st place solution at WSDM 2019 cup for fake news classification
☆45Feb 16, 2020Updated 6 years ago
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 5 months ago
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
meituan-longcat / MineExplorer
View on GitHub
Reproduction code for paper "MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft"
☆18Jun 12, 2026Updated last month
zli12321 / Vision-SR1
View on GitHub
Reinforcement Learning of Vision Language Models with Self Visual Perception Reward
☆175Mar 14, 2026Updated 4 months ago
open-compass / CIBench
View on GitHub
Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "
☆15Jul 19, 2024Updated 2 years ago
s-vco / s-vco
View on GitHub
Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images
☆19Jun 4, 2025Updated last year
WenkeHuang / SDEA
View on GitHub
ICML 2024 - Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated Learning
☆10Jul 16, 2024Updated 2 years ago
intel / TVP
View on GitHub
☆15Aug 4, 2025Updated 11 months ago
UesugiErii / tf2-PPO-atari
View on GitHub
Use tensorflow2 achieve PPO to play atari game
☆13Oct 25, 2019Updated 6 years ago