DistRL-lab / distrl-openLinks
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
☆25Updated 4 months ago
Alternatives and similar repositories for distrl-open
Users that are interested in distrl-open are comparing it to the libraries listed below
Sorting:
- ☆18Updated last month
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆38Updated last week
- Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs☆14Updated 4 months ago
- Official implementation of the NeurIPS 2024 paper CORY☆17Updated 4 months ago
- ☆14Updated 9 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆34Updated last year
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆11Updated 4 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆83Updated 2 months ago
- ☆25Updated last month
- Implementation of TWOSOME☆77Updated 6 months ago
- ☆20Updated last month
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆38Updated 5 months ago
- Official Code For: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective}☆9Updated 3 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆182Updated 3 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆251Updated last week
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆372Updated last year
- ICLR 2025 Agent-Related Papers☆71Updated 8 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆129Updated this week
- Reinforced Multi-LLM Agents training☆30Updated last month
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆53Updated last month
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆38Updated last month
- Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆120Updated last month
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆366Updated 4 months ago
- Rewarded soups official implementation☆58Updated last year
- ☆21Updated last month
- ☆241Updated last month
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆372Updated 7 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- ☆148Updated last week