There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
☆18Aug 16, 2017Updated 8 years ago
Alternatives and similar repositories for pg_rnn
Users that are interested in pg_rnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jan 3, 2023Updated 3 years ago
- Julia package for transfer operator spectral methods☆11Aug 14, 2024Updated last year
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- path planning with RRTs in Python!☆14Feb 28, 2025Updated last year
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Here we will to store papers from bayesgroup.ru☆11Dec 15, 2016Updated 9 years ago
- ☆10Nov 12, 2020Updated 5 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- A deep learning package for computer vision algorithms built on top of TensorFlow☆11Sep 12, 2018Updated 7 years ago
- The Matlab Code for the AISTATS 2015 paper "Learning Deep Sigmoid Belief Network with Data Augmentation"☆13Sep 20, 2015Updated 10 years ago
- COMS30017 Computational Neuroscience☆11Jan 7, 2022Updated 4 years ago
- Human activity recognition using hidden Markov model.☆10Jan 7, 2018Updated 8 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆17Oct 12, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- SocialCompliantRobot☆16Oct 10, 2023Updated 2 years ago
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- Coqtail is a library of mathematical theorems and tools proved inside the Coq proof assistant. Results range mostly from arithmetic to re…☆16Oct 20, 2025Updated 5 months ago
- Use basic deep reinforcement learning to solve Doom health gathering environment☆27Jun 21, 2018Updated 7 years ago
- Rosetta FunFolDes – a general framework for the computational design of functional proteins.☆21Apr 12, 2019Updated 6 years ago
- ☆14Sep 11, 2022Updated 3 years ago
- Andy Sloane's rotating donut in Julia☆31Apr 22, 2021Updated 4 years ago
- ☆39Jun 2, 2023Updated 2 years ago
- A system for computational category theory and applications☆40Jun 27, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆27Aug 5, 2022Updated 3 years ago
- Magnify motion in videos with Riesz pyramids.☆22Mar 27, 2015Updated 11 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆22Dec 30, 2021Updated 4 years ago
- ☆15Aug 14, 2025Updated 7 months ago
- Discrete differential geometry in Python. Derived from CMU and Caltech Codebases and Published papers☆32Aug 19, 2023Updated 2 years ago
- Formalizing geometry in Lean : IGL/UniHigh Summer 2020 research project☆31Jan 17, 2022Updated 4 years ago
- Recurrent Deterministic Policy Gradient actor-critic based Reinforcement Learning algorithm in Action☆37Feb 27, 2025Updated last year
- A multiprotocol and multiplatform quantum random number generation framework☆27Nov 27, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An autonomous exploration library☆69Jul 16, 2020Updated 5 years ago
- Bayesian low-rank adaptation for large language models☆28May 4, 2024Updated last year
- ☆12May 20, 2025Updated 10 months ago
- PCB design for the initial prototype of OBC hardware, to interface with LaunchPad.☆10Sep 19, 2018Updated 7 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago
- Prototype web interface that enables remote teleoperation of the Stretch RE1 mobile manipulator from Hello Robot Inc.☆12Dec 14, 2023Updated 2 years ago
- Spacecraft simulation.☆13Feb 26, 2026Updated last month