SuReLI / dyna-gymLinks
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆31Updated 6 years ago
Alternatives and similar repositories for dyna-gym
Users that are interested in dyna-gym are comparing it to the libraries listed below
Sorting:
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆76Updated 5 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- ☆44Updated 6 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆49Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆38Updated 5 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆148Updated 2 years ago
- MultiTask Environments for Reinforcement Learning.☆75Updated 2 years ago
- General implementation of Advantage Actor Critic using Pytorch☆27Updated 3 years ago
- Hierarchical Self-Play☆21Updated 6 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- ☆47Updated 4 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆208Updated 6 years ago
- ☆83Updated 4 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 5 years ago
- On the pitfalls of measuring emergent communication☆34Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆86Updated 5 years ago
- A framework for experimenting with never-ending learning☆79Updated 7 months ago
- ☆28Updated 2 years ago