aadharna / UntouchableThunderLinks
Co-evolution of agents and environments in GVG-AI
☆17Updated 4 years ago
Alternatives and similar repositories for UntouchableThunder
Users that are interested in UntouchableThunder are comparing it to the libraries listed below
Sorting:
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆30Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- Vectorization techniques for fast population-based training.☆57Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆73Updated last year
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- The source code for the gym-microrts paper.☆42Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆23Updated 6 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆86Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆69Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 6 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- PAIRED in PyTorch 🔥☆64Updated 2 years ago
- ☆32Updated 4 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Updated 5 years ago
- ☆33Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Baselines for gymnax 🤖☆74Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957☆67Updated 4 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Updated 3 years ago
- Reinforcement Learning with Latent Flow☆44Updated 4 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Updated 2 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆45Updated 2 years ago