mrahtz / easy-tf-log
Easy TensorFlow logging for quick prototypes
☆110Updated 3 years ago
Alternatives and similar repositories for easy-tf-log:
Users that are interested in easy-tf-log are comparing it to the libraries listed below
- Accompanying code for "Deep Reinforcement Learning that Matters"☆151Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆374Updated 2 years ago
- ☆117Updated 4 years ago
- NIPS 2017 Value Prediction Network☆165Updated 7 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 5 years ago
- Velocity in deep-learning research☆276Updated 2 years ago
- Full World Models Implementation in Chainer☆165Updated 6 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- for learning reinforcement learning using PyTorch.☆64Updated 5 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 6 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 6 years ago
- Reinforcement learning models in ViZDoom environment☆131Updated 3 years ago
- Code for the paper "Evolved Policy Gradients"☆250Updated 6 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- ☆159Updated 7 years ago
- Noisy Networks for Exploration☆186Updated 7 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆227Updated 7 years ago
- Publicly releasable baselines for the Retro contest☆127Updated 6 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- Implementation of PPO in Pytorch☆41Updated 7 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 7 years ago
- Benchmark and build RL architectures that can do multitask and transfer learning.☆142Updated 2 years ago
- Deep RL Algorithms implemented for UC Berkeley's CS 294-112: Deep Reinforcement Learning☆139Updated 7 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Updated 6 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 7 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago