DistRL-lab / distrl-open
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
☆22Updated 2 months ago
Alternatives and similar repositories for distrl-open
Users that are interested in distrl-open are comparing it to the libraries listed below
Sorting:
- ☆12Updated 2 months ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆35Updated 3 weeks ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆35Updated last year
- Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs☆9Updated last month
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆47Updated last year
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆10Updated 2 months ago
- ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning☆17Updated 11 months ago
- Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics☆13Updated 3 months ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆39Updated last month
- ☆21Updated 8 months ago
- ☆58Updated 2 months ago
- ☆15Updated 6 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 5 months ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆26Updated 4 months ago
- ☆78Updated last year
- ☆11Updated last year
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆35Updated last month
- Implementation of TWOSOME☆72Updated 4 months ago
- Direct preference optimization with f-divergences.☆13Updated 6 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆113Updated last week
- ☆29Updated last year
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆41Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆99Updated 3 months ago
- ☆28Updated last week
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆167Updated last month
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆39Updated 7 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 5 months ago
- ☆17Updated 7 months ago
- Natural Language Reinforcement Learning☆87Updated 4 months ago
- ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective☆31Updated last year