DistRL-lab / distrl-open
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
☆21Updated last month
Alternatives and similar repositories for distrl-open:
Users that are interested in distrl-open are comparing it to the libraries listed below
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆30Updated 2 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆35Updated 11 months ago
- ☆14Updated 6 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆46Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆37Updated 3 weeks ago
- A large-scale multi-modal pre-trained model☆131Updated 2 years ago
- ☆11Updated last year
- ☆21Updated 8 months ago
- ☆55Updated last month
- ☆27Updated last year
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 2 years ago
- ☆76Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago
- ☆23Updated 6 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- ☆30Updated 2 years ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆36Updated 5 months ago
- code for the paper Offline Prioritized Experience Replay☆13Updated last year
- ☆46Updated 2 years ago
- Direct preference optimization with f-divergences.☆13Updated 5 months ago
- Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"☆32Updated 2 months ago
- Official code repository for Prompt-DT.☆109Updated 2 years ago
- ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning☆17Updated 10 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆98Updated last year
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆26Updated 3 months ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆38Updated 3 months ago
- Natural Language Reinforcement Learning☆87Updated 4 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆62Updated 10 months ago
- ☆13Updated 5 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year