WillDreamer/ARL-Arena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WillDreamer/ARL-Arena)

WillDreamer / ARL-Arena

[ICML2026] ARLArena

☆90

Alternatives and similar repositories for ARL-Arena

Users that are interested in ARL-Arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Gen-Verse / Open-AgentRL
View on GitHub
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
☆595Jun 12, 2026Updated last month
WillDreamer / T2PO
View on GitHub
【ICML2026 Spotlight】 T2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning
☆51May 27, 2026Updated 2 months ago
applese233 / ICRL
View on GitHub
In-Context Reinforcement Learning for Tool Use in Large Language Models
☆48Mar 26, 2026Updated 4 months ago
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
DripNowhy / Octopus
View on GitHub
[ICML 2026] Official implementation for paper: Learning Self-Correction in Vision–Language Models via Rollout Augmentation
☆16Jun 4, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
horizon-llm / OpenKimi
View on GitHub
[ICML2026] Reproduce Kimi K1.5/K2 RL algorithm and rollout system
☆19Apr 9, 2026Updated 3 months ago
OoDBag / VisTA
View on GitHub
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
☆27May 31, 2025Updated last year
MTU-Bench-Team / MTU-Bench
View on GitHub
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models
☆60Jul 24, 2025Updated last year
mit-han-lab / vcpo
View on GitHub
[ICML 2026] Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs
☆29Apr 27, 2026Updated 3 months ago
OS-Copilot / OS-Symphony
View on GitHub
[ACL 2026 Main] Official repository for paper: OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agents
☆48Apr 7, 2026Updated 3 months ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
mandyyyyii / east
View on GitHub
☆19Aug 4, 2025Updated 11 months ago
pUmpKin-Co / ComplementaryRL
View on GitHub
Co-evolving policy actors and experience extractors for efficient experience-driven agent RL
☆51May 12, 2026Updated 2 months ago
muhaochen / box_embedding_paper_list
View on GitHub
A paper list for box embeddings
☆17Jun 9, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rycolab / kl-rb
View on GitHub
This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"
☆19May 30, 2025Updated last year
CoopReason / TESSY
View on GitHub
A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data
☆35May 1, 2026Updated 2 months ago
zhaosnw / evo_mem
View on GitHub
☆18Dec 21, 2025Updated 7 months ago
songmzhang / DSKDv2
View on GitHub
The official implementation of the paper "A Dual-Space Framework for General Knowledge Distillation of Large Language Models".
☆18Jan 4, 2026Updated 6 months ago
FreedomIntelligence / MyPhoneBench
View on GitHub
MyPhoneBench: Do Phone-Use Agents Respect Your Privacy?
☆24Apr 3, 2026Updated 3 months ago
ls-kelvin / REVPT
View on GitHub
Code for paper: Reinforced Vision Perception with Tools
☆74Oct 3, 2025Updated 9 months ago
OswaldHe / HMT-pytorch
View on GitHub
[NAACL 2025] Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"
☆80Mar 12, 2026Updated 4 months ago
Shawn-Guo-CN / Lossless_Text_Compression_with_Transformer
View on GitHub
This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.
☆14May 2, 2024Updated 2 years ago
transcend-0 / VibeQuant
View on GitHub
VibeQuant: Your Personal Quant Research Workbench
☆19Jul 9, 2026Updated 2 weeks ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
OpenMOSS / DiRL
View on GitHub
☆165Mar 30, 2026Updated 3 months ago
Phantivia / OpenAI_PTCompletion
View on GitHub
A Parallel Completion Python Library that boosts your OpenAI-API query with task queue & multiprocessing.
☆26May 15, 2023Updated 3 years ago
HITsz-TMG / VisionGraph
View on GitHub
The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…
☆17May 27, 2024Updated 2 years ago
naver / debit
View on GitHub
☆16Jul 24, 2024Updated 2 years ago
keikeiqi / MGTTA
View on GitHub
AAAI2025
☆13Apr 18, 2025Updated last year
MiroMindAI / MiroEval
View on GitHub
MiroEval: A benchmark and evaluation framework for deep research agents — 100 tasks (70 text, 30 multimodal) assessed across synthesis qu…
☆46Jul 6, 2026Updated 3 weeks ago
roychowdhuryresearch / gsw-memory
View on GitHub
Code corresponding to Generative Semantic Workspaces - Long term Structured Memory for Large Language Models - AAAI 26 (Oral), ICML 26
☆22Jun 2, 2026Updated last month
HHHHHejia / Awesome-AgenticLLM-RL-Papers
View on GitHub
☆1,849Jun 18, 2026Updated last month
Wloner0809 / ustc-course-resource
View on GitHub
a repo for sharing ustc course resources
☆37Sep 21, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HJSang / OPSD_OnPolicyDistillation
View on GitHub
On Policy Distillation Build on top of Verl
☆92May 25, 2026Updated 2 months ago
agents-x-project / PyVision-RL
View on GitHub
[ICML 2026] Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."
☆70Feb 25, 2026Updated 5 months ago
pettingllms-ai / PettingLLMs
View on GitHub
[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system; [arxiv] MetaAgent-X: End-to-End Reinforcement Learning Automatic Mult…
☆206May 15, 2026Updated 2 months ago
yxzwang / FamilyTool
View on GitHub
FamilyTool benchmark
☆14Sep 10, 2025Updated 10 months ago
qwenpilot / FIPO
View on GitHub
This code implements the algorithm of FIPO, a value-free RL recipe for eliciting deeper reasoning from a clean base model.
☆130Apr 7, 2026Updated 3 months ago
TuringEyeTest / TuringEyeTest
View on GitHub
Pixels, Patterns, but no Poetry: To See the World like Humans
☆18Aug 11, 2025Updated 11 months ago
ha0ransun / Path-Auxiliary-Sampler
View on GitHub
☆10Feb 22, 2023Updated 3 years ago