zuoxingdong / VIN_PyTorch_VisdomLinks
PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.
☆227Updated 8 years ago
Alternatives and similar repositories for VIN_PyTorch_Visdom
Users that are interested in VIN_PyTorch_Visdom are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)☆319Updated 4 years ago
- ☆159Updated 7 years ago
- Value Iteration Networks☆289Updated 8 years ago
- [ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration☆203Updated 7 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 7 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 8 years ago
- Noisy Networks for Exploration☆185Updated 7 years ago
- for learning reinforcement learning using PyTorch.☆64Updated 5 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 8 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆114Updated 9 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Updated 8 years ago
- Reinforcement learning models in ViZDoom environment☆133Updated 3 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆152Updated 7 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 6 years ago
- This's an implementation of deepmind Visual Interaction Networks paper using pytorch☆166Updated 7 years ago
- Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017☆150Updated 9 months ago
- [NIPS 2017] InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆180Updated 6 months ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 8 years ago
- Implement A3C for Mujoco gym envs☆72Updated 7 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Updated 6 years ago
- ☆101Updated 8 years ago
- A list of deep neural network architectures for reinforcement learning tasks.☆169Updated 8 years ago
- third person imitation learning. Archival only.☆76Updated 5 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆277Updated 7 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago
- Proximal Policy Optimization in PyTorch☆39Updated 7 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆120Updated 8 years ago