fuxiAIlab / xrl-bench
☆17Updated 6 months ago
Alternatives and similar repositories for xrl-bench:
Users that are interested in xrl-bench are comparing it to the libraries listed below
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last month
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆18Updated 6 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆32Updated 7 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆53Updated 6 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- ☆26Updated last year
- ☆41Updated last year
- Model-Based Offline Reinforcement Learning☆48Updated 4 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆43Updated 7 months ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆30Updated last month
- Benchmarked implementations of Offline RL Algorithms.☆68Updated last week
- ☆27Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆69Updated 2 years ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆20Updated 3 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 2 years ago
- ☆32Updated 6 months ago
- ☆17Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 9 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆19Updated 3 years ago
- ☆54Updated 11 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- ☆41Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆28Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated 2 years ago