aadharna / UntouchableThunderLinks
Co-evolution of agents and environments in GVG-AI
☆18Updated 4 years ago
Alternatives and similar repositories for UntouchableThunder
Users that are interested in UntouchableThunder are comparing it to the libraries listed below
Sorting:
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 4 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- ☆31Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆89Updated 4 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆83Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆21Updated 6 years ago
- PAIRED in PyTorch 🔥☆63Updated 2 years ago
- Reinforcement Learning with Latent Flow☆44Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆45Updated 3 years ago
- ☆32Updated last year
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆44Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆75Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- A collection of RL algorithms written in JAX.☆103Updated 3 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆13Updated last year
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆68Updated last year
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆16Updated 5 years ago