instadeepai / matrax
A collection of matrix games in JAX
☆9Updated 3 months ago
Alternatives and similar repositories for matrax:
Users that are interested in matrax are comparing it to the libraries listed below
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆57Updated last year
- ☆18Updated last month
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆50Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆48Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 4 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- POPGym Library in JAX☆11Updated 10 months ago
- A tool for aggregating and plotting MARL experiment data.☆73Updated last month
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆74Updated 6 months ago
- ☆73Updated 4 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆21Updated 3 months ago
- The Starcraft Multi-Agent challenge lite☆42Updated 6 months ago
- ☆41Updated last year
- Learning diverse options through the Laplacian representation.☆23Updated last year
- An Open-Ended Agentic Simulator☆43Updated 7 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Simple JAX Graphics Library.☆34Updated 4 months ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- ☆47Updated 2 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆67Updated 9 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆141Updated 3 months ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 6 months ago
- Baselines for gymnax 🤖☆66Updated last year
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆15Updated 10 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- Conservative Q learning in Jax☆53Updated 2 years ago
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated 4 months ago