aadharna / UntouchableThunderLinks

Co-evolution of agents and environments in GVG-AI

☆17

Alternatives and similar repositories for UntouchableThunder

Users that are interested in UntouchableThunder are comparing it to the libraries listed below

Sorting:

eilab-gt / NovGrid
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆35Updated last year
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆48Updated 3 years ago
alex-petrenko / curious-rl
Curiosity-driven Exploration by Self-supervised Prediction
☆21Updated 6 years ago
danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆28Updated 3 years ago
schmidtdominik / Rainbow
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …
☆45Updated 3 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆87Updated 4 years ago
younggyoseo / RE3
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆69Updated 3 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆39Updated 8 months ago
ryanrudes / minimal_goexplore
A minimal implementation of Go-Explore without domain knowledge
☆16Updated 4 years ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
ElisevanderPol / symmetrizer
☆31Updated 4 years ago
tedmoskovitz / TOP
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Updated 2 years ago
machelreid / can-wikipedia-help-offline-rl
Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu
☆105Updated 3 years ago
kvfrans / powderworld
Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
☆68Updated 10 months ago
clvrai / create
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Updated 2 years ago
Kaixhin / EC
Episodic Control
☆21Updated 2 years ago
younggyoseo / trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Updated 4 years ago
denisyarats / proto
Proto-RL: Reinforcement Learning with Prototypical Representations
☆82Updated 3 years ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
google-research / pisac
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
☆44Updated 2 years ago
vwxyzjn / gym-microrts-paper
The source code for the gym-microrts paper.
☆42Updated 2 years ago
WendyShang / flare
Reinforcement Learning with Latent Flow
☆42Updated 4 years ago
louiskirsch / metagenrl
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆67Updated 5 years ago
flowersteam / TeachMyAgent
TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.
☆73Updated last year
YYCAAA / V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆47Updated 4 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
ademiadeniji / irm
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆43Updated last year
google-deepmind / active_ops
☆32Updated 11 months ago
gkswamy98 / pillbox
Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.
☆21Updated 3 years ago