instadeepai / matraxLinks
A collection of matrix games in JAX
β12Updated 11 months ago
Alternatives and similar repositories for matrax
Users that are interested in matrax are comparing it to the libraries listed below
Sorting:
- Code for Discovered Policy Optimisation (NeurIPS 2022)β12Updated 2 years ago
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ60Updated 2 years ago
- β85Updated last month
- Baselines for gymnax π€β72Updated 2 years ago
- β86Updated 11 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papersβ55Updated 2 years ago
- Accelerated replay buffers in JAXβ43Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ20Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU settingβ187Updated 7 months ago
- Accelerated minigrid environments with JAXβ151Updated last week
- Drop-in environment replacements that make your RL algorithm train faster.β21Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learningβ72Updated last year
- Efficient baselines for autocurricula in JAX.β196Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ116Updated last year
- Unified Implementations of Offline Reinforcement Learning Algorithmsβ115Updated 2 weeks ago
- An Open-Ended Agentic Simulatorβ52Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β59Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β110Updated last year
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ256Updated last month
- β18Updated 5 months ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learningβ17Updated 3 years ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ104Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Functionβ13Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.β79Updated 9 months ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β243Updated last week
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Vectorization techniques for fast population-based training.β56Updated 3 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β60Updated last month
- POPGym Library in JAXβ11Updated last year
- Learning diverse options through the Laplacian representation.β23Updated last year