fuxiAIlab / xrl-benchLinks
☆20Updated last year
Alternatives and similar repositories for xrl-bench
Users that are interested in xrl-bench are comparing it to the libraries listed below
Sorting:
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆75Updated 6 months ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆33Updated 9 months ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆33Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆38Updated 7 months ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆44Updated 6 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 3 years ago
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆65Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆62Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆35Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆103Updated 11 months ago
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- ☆237Updated 10 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆116Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Learning to Incentivize Other Learning Agents☆34Updated 3 years ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆29Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆189Updated 3 years ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆162Updated 3 months ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆76Updated 2 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆18Updated last year
- Re-implementations of SOTA RL algorithms.☆135Updated 2 years ago
- Synthetic Experience Replay☆103Updated last year
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆23Updated 3 years ago