MetabrainAGI / Awaker2.5-R1Links

☆12

Alternatives and similar repositories for Awaker2.5-R1

Users that are interested in Awaker2.5-R1 are comparing it to the libraries listed below

Sorting:

ShadeCloak / ADORA
☆47Updated 9 months ago
hkust-nlp / Laser
[ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
☆62Updated 8 months ago
PRIME-RL / Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆419Updated 6 months ago
tongjingqi / Game-RL
Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning
☆130Updated last week
ssmisya / PRMBench
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
☆88Updated 11 months ago
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆70Updated 6 months ago
xiaomi-research / colar
[NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
☆74Updated 6 months ago
chengzu-li / MVoT
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)
☆67Updated 9 months ago
xuyige / SoftCoT
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆76Updated 8 months ago
NUS-TRAIL / NoisyRollout
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆104Updated 4 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆88Updated last year
RenShuhuai-Andy / my-tools
my commonly-used tools
☆64Updated last year
TheRoadQaQ / ReLIFT
Official Repository of "Learning what reinforcement learning can't"
☆79Updated last month
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆180Updated 8 months ago
InfiMM / Awesome-Multimodal-LLM-for-Math-STEM
Paper collections of multi-modal LLM for Math/STEM/Code.
☆135Updated 2 months ago
ThinkMorph / ThinkMorph
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆142Updated last week
zhyang2226 / AR-Lopti
[ICLR 2026] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs
☆41Updated 8 months ago
QingyangZhang / Label-Free-RLVR
☆305Updated 7 months ago
OpenRLHF / OpenRLHF-M
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
☆154Updated last month
bigai-nlco / LatentSeek
Official Repository of LatentSeek
☆76Updated 8 months ago
LightChen233 / M3CoT
☆88Updated last year
TideDra / VL-RLHF
A RLHF Infrastructure for Vision-Language Models
☆195Updated last year
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆250Updated 3 months ago
ltzheng / SimpleTIR
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆353Updated 3 weeks ago
OpenDCAI / Awesome_MLLMs_Reasoning
☆113Updated 4 months ago
TianHongZXY / RLVR-Decomposed
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆160Updated 3 months ago
EMMA-Bench / EMMA
[ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…
☆69Updated 6 months ago
ritzz-ai / GUI-R1
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
☆220Updated 9 months ago
CJReinforce / PURE
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
☆153Updated 3 months ago
kiaia / GIRAFFE
Extending context length of visual language models
☆12Updated last year