There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
☆18Aug 16, 2017Updated 8 years ago
Alternatives and similar repositories for pg_rnn
Users that are interested in pg_rnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Apr 12, 2017Updated 9 years ago
- Learning to optimize (L2O) package that provides basic functionalities to help fit proxy models for optimization.☆15Apr 1, 2025Updated last year
- ☆11Jan 3, 2023Updated 3 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Neural style in Julia☆11Feb 8, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- Here we will to store papers from bayesgroup.ru☆11Dec 15, 2016Updated 9 years ago
- ☆10Nov 12, 2020Updated 5 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- A cross-platform C++11 implementation of the CMM language interpreter☆11Jul 18, 2025Updated 9 months ago
- A deep learning package for computer vision algorithms built on top of TensorFlow☆11Sep 12, 2018Updated 7 years ago
- ☆13Jun 10, 2025Updated 10 months ago
- ☆17May 31, 2023Updated 2 years ago
- The Matlab Code for the AISTATS 2015 paper "Learning Deep Sigmoid Belief Network with Data Augmentation"☆13Sep 20, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A PyTorch implementation of paper "Visualizing and Understanding Recurrent Networks"☆10Mar 16, 2018Updated 8 years ago
- ☆19Apr 20, 2023Updated 2 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Apr 17, 2018Updated 8 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"☆84May 29, 2017Updated 8 years ago
- Quantum randomness source using the ANU hardware QRNG☆17Feb 8, 2024Updated 2 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆17Oct 12, 2022Updated 3 years ago
- Project Euler☆10Dec 8, 2018Updated 7 years ago
- SocialCompliantRobot☆16Oct 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Creating DRL infrastructure for Dynamic Beta with Zipline and Keras☆14Dec 8, 2022Updated 3 years ago
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- Computational Neuroscience 3rd year CS course at the University of Bristol☆12Jul 19, 2022Updated 3 years ago
- Coqtail is a library of mathematical theorems and tools proved inside the Coq proof assistant. Results range mostly from arithmetic to re…☆16Mar 25, 2026Updated 3 weeks ago
- ☆18Jun 3, 2024Updated last year
- TensorFlow implementation of Faster RCNN for Object Detection☆16Apr 10, 2018Updated 8 years ago
- ☆20Feb 20, 2017Updated 9 years ago
- A julia package for bayesian optimization of black box functions.☆23Dec 4, 2020Updated 5 years ago
- breadth-first search in parallel☆18May 30, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Black Box Variational Inference for Bayesian logistic regression☆18Apr 1, 2017Updated 9 years ago
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆27Aug 5, 2022Updated 3 years ago
- Master Thesis. Code written in python. (Keras with Tensorflow backend)☆23Jun 16, 2020Updated 5 years ago
- A minimal implementation of neural network for MNIST experiment. Used as an exercise to help understanding Backpropagation by implementin…☆73Jan 8, 2024Updated 2 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆23Dec 30, 2021Updated 4 years ago
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago