modelbased / minirllab
Mini RL Lab
☆14Updated 3 months ago
Related projects: ⓘ
- Various reinforcement learning algorithms written in Jax + Flax☆21Updated last year
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆46Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆125Updated this week
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆46Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆76Updated 4 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆53Updated 3 months ago
- Skeleton for scalable and flexible Jax RL implementations☆58Updated last year
- JAX implementation of RL algorithms and vectorized environments☆32Updated 8 months ago
- The Starcraft Multi-Agent challenge lite☆32Updated last week
- ☆61Updated 9 months ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆37Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆46Updated 5 months ago
- Adaptable tools to make reinforcement learning and evolutionary computation algorithms.☆53Updated 2 years ago
- ☆40Updated 2 months ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆51Updated last month
- Synthetic Experience Replay☆62Updated 3 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- Learning diverse options through the Laplacian representation.☆22Updated 8 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆33Updated last week
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆50Updated 11 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆137Updated 3 months ago
- A tool for aggregating and plotting MARL experiment data.☆57Updated 3 weeks ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆51Updated 5 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆67Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆49Updated 8 months ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆91Updated last year
- Robust Reinforcement Learning Suite☆18Updated 3 months ago
- Reinforcement learning training framework for entity-gym environments.☆14Updated 6 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago