ASTRAL-Group / AlphaOneLinks

[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

☆85

Alternatives and similar repositories for AlphaOne

Users that are interested in AlphaOne are comparing it to the libraries listed below

Sorting:

yihedeng9 / OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆119Updated 3 months ago
Gabesarch / ICAL
☆52Updated 6 months ago
callsys / GMPO
Geometric-Mean Policy Optimization
☆92Updated this week
xufangzhi / Genius
[ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework
☆71Updated 5 months ago
JieyuZ2 / TaskMeAnything
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
☆73Updated 11 months ago
RifleZhang / LLaVA-Reasoner-DPO
☆99Updated 10 months ago
kokolerk / TON
[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆48Updated last month
yunfeixie233 / ViGaL
☆62Updated last month
OpenGVLab / ZeroGUI
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆101Updated 4 months ago
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆166Updated 5 months ago
agents-x-project / PyVision
[MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."
☆134Updated 4 months ago
chengyou-jia / AgentStore
[ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
☆41Updated 11 months ago
Xuekai-Zhu / FlowRL
☆108Updated last week
dvlab-research / ARPO
Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay
☆137Updated 5 months ago
TIGER-AI-Lab / Hierarchical-Reasoner
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning
☆48Updated 3 weeks ago
VisualWebBench / VisualWebBench
Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
☆60Updated last year
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆109Updated 5 months ago
Wang-ML-Lab / multimodal-needle-in-a-haystack
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models
☆50Updated 6 months ago
LAMDASZ-ML / Self-Backtracking
☆51Updated 9 months ago
SalesforceAIResearch / LATTE
☆68Updated 2 months ago
zhijie-group / SIFT
SIFT: Grounding LLM Reasoning in Contexts via Stickers
☆58Updated 8 months ago
microsoft / MageBench
Official Repo for MageBench: Bridging Large Multimodal Models to Agents
☆21Updated 10 months ago
mbzuai-oryx / Agent-X
Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks
☆32Updated last week
JiuTian-VL / Optimus-1
[NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
☆87Updated 5 months ago
yannqi / R-4B
The official repository of "R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration"
☆122Updated 2 months ago
waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆54Updated 5 months ago
mathllm / MATH-V
[NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.
☆120Updated 6 months ago
TEAM-ARM / arm
[NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model
☆56Updated 3 weeks ago
shulin16 / MMInA
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆47Updated 8 months ago
microsoft / x-reasoner
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆49Updated 6 months ago