DistRL-lab / distrl-openLinks

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents

☆25

Alternatives and similar repositories for distrl-open

Users that are interested in distrl-open are comparing it to the libraries listed below

Sorting:

ai-agents-2030 / DistRL-open
☆18Updated last month
ai-agents-2030 / SPA-Bench
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
☆38Updated last week
TU2021 / DPO-VP
Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs
☆14Updated 4 months ago
Harry67Hu / CORY
Official implementation of the NeurIPS 2024 paper CORY
☆17Updated 4 months ago
todexter3 / Richelieu
☆14Updated 9 months ago
OpenDFM / Rememberer
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆34Updated last year
nicoladainese96 / code-world-models
Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.
☆11Updated 4 months ago
CUHK-ARISE / GAMABench
Benchmarking LLMs' Gaming Ability in Multi-Agent Environments
☆83Updated 2 months ago
heaplax / ARMAP
☆25Updated last month
WeihaoTan / TWOSOME
Implementation of TWOSOME
☆77Updated 6 months ago
PKU-Alignment / eval-anything
☆20Updated last month
YangRui2015 / Generalizable-Reward-Model
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
☆38Updated 5 months ago
sfasfaffa / DLPO
Official Code For: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective}
☆9Updated 3 months ago
YifeiZhou02 / ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
☆182Updated 3 months ago
PRIME-RL / Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆251Updated last week
123penny123 / Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
☆372Updated last year
Aaron617 / ICLR-2025-Submissions-Agent
ICLR 2025 Agent-Related Papers
☆71Updated 8 months ago
louieworth / awesome-rlhf
An index of algorithms for reinforcement learning from human feedback (rlhf))
☆92Updated last year
CJReinforce / PURE
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
☆129Updated this week
ziyuwan / ReMA-public
Reinforced Multi-LLM Agents training
☆30Updated last month
Reallm-Labs / InfiGUI-R1
Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"
☆53Updated last month
sjtu-marl / DPT-Agent
This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…
☆38Updated last month
lll6gg / UI-R1
Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"
☆120Updated last month
DigiRL-agent / digirl
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
☆366Updated 4 months ago
alexrame / rewardedsoups
Rewarded soups official implementation
☆58Updated last year
ASTRAL-Group / data-efficient-llm-rl
☆21Updated last month
GAIR-NLP / ToRL
☆241Updated last month
RL4VLM / RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆372Updated 7 months ago
Jiacheng-Zhu-AIML / AsymmetryLoRA
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆35Updated last year
MASWorks / MASLab
☆148Updated last week