openai / robosumo
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
☆305Updated last year
Alternatives and similar repositories for robosumo:
Users that are interested in robosumo are comparing it to the libraries listed below
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆214Updated 5 years ago
- Code for the paper "Evolved Policy Gradients"☆249Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆276Updated 4 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆200Updated 6 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆233Updated 2 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆198Updated 4 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆396Updated last year
- Actor-critic with experience replay☆251Updated 2 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 5 years ago
- Publicly releasable baselines for the Retro contest☆128Updated 6 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆206Updated 5 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆343Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 6 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆180Updated 6 years ago
- ☆118Updated 4 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆416Updated last year
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆178Updated 7 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆171Updated 7 years ago
- Velocity in deep-learning research☆276Updated 2 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆200Updated 4 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆208Updated last year
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆367Updated 3 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆197Updated 3 years ago
- A PyTorch implementation of Rainbow DQN agent☆166Updated 6 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 4 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated 2 months ago