kinalmehta / marl-jaxLinks
JAX library for MARL research
β86Updated last year
Alternatives and similar repositories for marl-jax
Users that are interested in marl-jax are comparing it to the libraries listed below
Sorting:
- Baselines for gymnax π€β66Updated 2 years ago
- JAX implementations of core Deep RL algorithmsβ79Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ112Updated 9 months ago
- A collection of RL algorithms written in JAX.β98Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ167Updated 2 months ago
- Accelerated minigrid environments with JAXβ138Updated 3 weeks ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ73Updated 9 months ago
- Benchmarking RL generalization in an interpretable way.β156Updated 2 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papersβ52Updated 2 years ago
- β45Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"β48Updated 11 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β53Updated 3 weeks ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"β100Updated 3 years ago
- A tool for aggregating and plotting MARL experiment data.β77Updated 4 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objectiveβ80Updated 2 years ago
- β79Updated 2 months ago
- Accelerated replay buffers in JAXβ41Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β56Updated 2 years ago
- Gridworld domains in the gym interfaceβ28Updated 8 months ago
- Partially Observable Process Gymβ190Updated 11 months ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ236Updated 2 months ago
- β101Updated last year
- Codebase for the Graph-based Policy Learning algorithm, which is designed for learning policies to solve the open ad hoc teamwork problemβ¦β34Updated 4 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β101Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the β¦β86Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according β¦β35Updated last year
- Library to compare and evaluate reward functionsβ67Updated last year
- PAIRED in PyTorch π₯β60Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ52Updated 2 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARLβ42Updated 8 months ago