david-lindner / safe-grid-gymLinks
A gym interface for AI safety gridworlds created in pycolab.
☆18Updated 3 years ago
Alternatives and similar repositories for safe-grid-gym
Users that are interested in safe-grid-gym are comparing it to the libraries listed below
Sorting:
- ☆324Updated last year
- A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).☆32Updated 3 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆23Updated last year
- Partially Observable Process Gym☆211Updated 7 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆50Updated last year
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Updated 3 years ago
- Gridworld for MARL experiments☆144Updated 4 years ago
- Benchmarking RL generalization in an interpretable way.☆174Updated last month
- Gridworld domains in the gym interface☆29Updated last year
- A tool for aggregating and plotting MARL experiment data.☆80Updated 11 months ago
- PAIRED in PyTorch 🔥☆64Updated 2 years ago
- Object Centric Atari games☆96Updated last month
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆62Updated 3 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Updated 3 years ago
- Simple gym environments for safety in Reinforcement Learning Research☆18Updated last year
- ☆18Updated 2 years ago
- Nethack Learning Environment Wrapper for Language Interface☆41Updated 2 years ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated 2 years ago
- ☆248Updated last year
- Learning Laplacian Representations in Reinforcement Learning☆18Updated 5 years ago
- SocialJax: sequential social dilemma environments☆60Updated last month
- ☆47Updated last year
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆268Updated 3 months ago
- ☆27Updated 10 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 5 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Updated 7 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆342Updated last year
- ☆29Updated last year
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆216Updated 4 years ago
- Benchmarking the Spectrum of Agent Capabilities☆507Updated last year