DistRL-lab / distrl-openLinks
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
☆24Updated 5 months ago
Alternatives and similar repositories for distrl-open
Users that are interested in distrl-open are comparing it to the libraries listed below
Sorting:
- ☆21Updated 7 months ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆54Updated 6 months ago
- DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation☆34Updated 2 weeks ago
- Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs☆18Updated 9 months ago
- [ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"☆33Updated last month
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Updated 10 months ago
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆35Updated 2 months ago
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆91Updated 3 weeks ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆407Updated 6 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆386Updated last month
- Official implementation of the NeurIPS 2024 paper CORY☆26Updated 2 weeks ago
- [AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆142Updated last month
- [ICML 2025] Official Implementation of GLIDER☆72Updated 3 months ago
- ☆25Updated 7 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆395Updated 3 months ago
- Training VLM agents with multi-turn reinforcement learning☆365Updated last week
- ☆208Updated 5 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆46Updated 10 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆211Updated 8 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆148Updated 2 months ago
- ICLR 2025 Agent-Related Papers☆74Updated last year
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆50Updated 6 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆94Updated last year
- Building a comprehensive and handy list of papers for GUI agents☆592Updated 2 months ago
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆78Updated 7 months ago
- ☆21Updated 5 months ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆38Updated 6 months ago
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆387Updated 10 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆231Updated last month
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆348Updated 3 months ago