DistRL-lab / distrl-open
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
☆19Updated last month
Alternatives and similar repositories for distrl-open:
Users that are interested in distrl-open are comparing it to the libraries listed below
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆29Updated last month
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆35Updated 11 months ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆28Updated 5 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆32Updated 11 months ago
- ☆23Updated 5 months ago
- ☆55Updated 3 weeks ago
- A large-scale multi-modal pre-trained model☆131Updated 2 years ago
- Implementation of TWOSOME☆69Updated 2 months ago
- ☆74Updated last year
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆328Updated 3 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆44Updated 11 months ago
- ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning☆15Updated 9 months ago
- ☆13Updated 5 months ago
- ☆20Updated 7 months ago
- ☆11Updated 11 months ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆12Updated last year
- ☆27Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆93Updated last month
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆161Updated last year
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆41Updated 2 months ago
- ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective☆28Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 4 months ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆35Updated 4 months ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆39Updated 6 months ago
- [NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…☆20Updated last month
- Official code repository for Prompt-DT.☆107Updated 2 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 3 months ago
- ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward☆18Updated 2 years ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆71Updated 7 months ago
- EDIS: Energy-guided DIffusion Sampling☆9Updated 7 months ago