chrischute / pbt
Population-Based Training in Python
☆18Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for pbt
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆31Updated 6 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 7 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Ranking Policy Gradient☆23Updated 4 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- simple reinforcement learning example for the minecraft☆9Updated 6 years ago
- ☆42Updated 5 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 7 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Gym wrapper for Vizdoom environments☆12Updated 5 years ago
- ☆80Updated last year
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- ☆35Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- Solving reinforcement learning tasks which require language and vision☆32Updated last year
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- Dead-ends and Secure Exploration in Reinforcement Learning☆11Updated 5 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- ☆32Updated 6 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- A simple Gridworld environment for Open AI gym☆24Updated 6 years ago