PyTorch Implementation of REINFORCE for both discrete & continuous control
☆266Apr 16, 2017Updated 9 years ago
Alternatives and similar repositories for pytorch-REINFORCE
Users that are interested in pytorch-REINFORCE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44Sep 6, 2017Updated 8 years ago
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Jun 14, 2017Updated 8 years ago
- Pytorch implementation of DeepMind's differentiable neural computer paper.☆92Dec 4, 2017Updated 8 years ago
- ☆11Aug 16, 2015Updated 10 years ago
- Deep Reinforcement Learning with pytorch & visdom☆805Jul 16, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Sep 5, 2016Updated 9 years ago
- Implement A3C for Mujoco gym envs☆73Nov 2, 2017Updated 8 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆174Nov 3, 2016Updated 9 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- ☆38Mar 6, 2017Updated 9 years ago
- An implementation of Color2Gray with convolutional neural networks☆11Dec 23, 2015Updated 10 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,901May 29, 2022Updated 4 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆30Aug 30, 2016Updated 9 years ago
- Deep Q-Learning Network in pytorch (not actively maintained)☆430Nov 1, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,326Sep 25, 2019Updated 6 years ago
- Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)☆318Oct 2, 2020Updated 5 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Mar 13, 2017Updated 9 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- auto-tuning momentum SGD optimizer☆287Mar 24, 2019Updated 7 years ago
- ☆58Aug 28, 2018Updated 7 years ago
- Torch implementation of the Deep Network for Global Optimization (DNGO)☆51Jul 26, 2016Updated 9 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆33Oct 6, 2017Updated 8 years ago
- Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch☆119Oct 19, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RWA in pytorch☆14May 7, 2017Updated 9 years ago
- An implementation of the deep convolutional generative adversarial network, combined with a varational autoencoder☆108Mar 18, 2017Updated 9 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆278Feb 20, 2018Updated 8 years ago
- Evolution Strategies in PyTorch☆352Sep 11, 2017Updated 8 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆224Mar 29, 2017Updated 9 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,425Apr 16, 2024Updated 2 years ago
- Optimizing control variates for black-box gradient estimation☆163Jul 26, 2019Updated 6 years ago
- PyTorch implementation of Trust Region Policy Optimization☆448Sep 13, 2018Updated 7 years ago
- PyTorch implementation of Global Vectors for Word Representation.☆90Mar 6, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Attentive Recurrent Comparators☆56Mar 3, 2017Updated 9 years ago
- Hadamard Product for Low-rank Bilinear Pooling☆71Nov 6, 2017Updated 8 years ago
- ☆31May 8, 2017Updated 9 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆113Apr 3, 2017Updated 9 years ago
- Generate captions for an image using convolutional and recurrent networks☆12Feb 25, 2016Updated 10 years ago
- VLFeat (partial) FFI wrapper for Torch7☆12Mar 23, 2016Updated 10 years ago
- Batch-Normalized LSTM (Recurrent Batch Normalization) implementation in Torch.☆90May 22, 2016Updated 10 years ago