proroklab / popgymLinks
Partially Observable Process Gym
☆211Updated 7 months ago
Alternatives and similar repositories for popgym
Users that are interested in popgym are comparing it to the libraries listed below
Sorting:
- Benchmarking RL generalization in an interpretable way.☆174Updated 2 months ago
- ☆249Updated last year
- ☆360Updated 3 years ago
- A tool for aggregating and plotting MARL experiment data.☆82Updated last year
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆340Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆236Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆232Updated 2 months ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆45Updated 3 months ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆321Updated 2 years ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated 2 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆86Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆161Updated 2 years ago
- ☆202Updated 2 years ago
- Datasets with baselines for Offline MARL.☆201Updated 2 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆162Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆204Updated last year
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆127Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆160Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆165Updated 2 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆79Updated 3 years ago
- ☆308Updated 4 years ago
- ☆325Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- Lightweight multi-agent gridworld Gym environment☆214Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆103Updated 3 years ago
- A collection of RL algorithms written in JAX.☆104Updated 3 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Updated 3 years ago