chrischute / pbt
Population-Based Training in Python
☆18Updated 5 years ago
Related projects: ⓘ
- ☆42Updated 5 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆51Updated 5 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆31Updated 6 years ago
- On the pitfalls of measuring emergent communication☆33Updated 5 years ago
- ☆80Updated 11 months ago
- Boiler plate code for Torch based ML projects☆10Updated 3 years ago
- Optimized Differentiable Neural Computer In Chainer☆23Updated 6 years ago
- ☆31Updated 5 years ago
- A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.☆10Updated 4 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆38Updated 6 years ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.☆13Updated 3 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 3 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 5 years ago
- ☆45Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Code for human intervention reinforcement learning☆33Updated 6 years ago
- Ranking Policy Gradient☆23Updated 4 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆77Updated 11 months ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆41Updated 5 years ago
- fork of rl-baseline-zoo☆21Updated 4 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago