fuxiAIlab / xrl-benchLinks
☆21Updated last year
Alternatives and similar repositories for xrl-bench
Users that are interested in xrl-bench are comparing it to the libraries listed below
Sorting:
- Model-Based Offline Reinforcement Learning☆52Updated 5 years ago
- Benchmarked implementations of Offline RL Algorithms.☆76Updated 11 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 11 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆104Updated 3 years ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Updated last year
- Re-implementations of SOTA RL algorithms.☆136Updated 2 years ago
- ☆15Updated 4 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆34Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆55Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆51Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆94Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- ☆18Updated 3 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Synthetic Experience Replay☆107Updated last year
- ☆60Updated 3 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago
- ☆48Updated 2 months ago
- ☆31Updated 3 years ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆51Updated 10 months ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆72Updated 3 years ago
- ☆44Updated 4 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆27Updated 2 years ago
- An open source benchmark for Multi Agent Reinforcement Learning☆31Updated 2 years ago