There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
☆18Aug 16, 2017Updated 8 years ago
Alternatives and similar repositories for pg_rnn
Users that are interested in pg_rnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jan 3, 2023Updated 3 years ago
- Julia package for transfer operator spectral methods☆11Aug 14, 2024Updated last year
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 9 years ago
- A binding of the physics engine Chipmunk for Julia☆10Sep 26, 2015Updated 10 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Here we will to store papers from bayesgroup.ru☆11Dec 15, 2016Updated 9 years ago
- ☆10Nov 12, 2020Updated 5 years ago
- CLARA: Confidence of Labels and Raters☆11Jun 3, 2023Updated 3 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- A deep learning package for computer vision algorithms built on top of TensorFlow☆11Sep 12, 2018Updated 7 years ago
- The Matlab Code for the AISTATS 2015 paper "Learning Deep Sigmoid Belief Network with Data Augmentation"☆13Sep 20, 2015Updated 10 years ago
- Human activity recognition using hidden Markov model.☆10Jan 7, 2018Updated 8 years ago
- MPC trajectory tracking + reachability-based collision avoidance for pairwise vehicle interactions☆19Jul 10, 2020Updated 5 years ago
- NeurIPS 2020☆17Jun 18, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆21Oct 23, 2023Updated 2 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Apr 17, 2018Updated 8 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆17Oct 12, 2022Updated 3 years ago
- Project Euler☆10Dec 8, 2018Updated 7 years ago
- ☆11Jul 5, 2020Updated 5 years ago
- SocialCompliantRobot☆16Oct 10, 2023Updated 2 years ago
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- Creating DRL infrastructure for Dynamic Beta with Zipline and Keras☆14Dec 8, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Computational Neuroscience 3rd year CS course at the University of Bristol☆13Jul 19, 2022Updated 3 years ago
- Coqtail is a library of mathematical theorems and tools proved inside the Coq proof assistant. Results range mostly from arithmetic to re…☆16Apr 13, 2026Updated 2 months ago
- tensorflow implementation of cnn localization project by CSAIL@MIT (CVPR'16)☆18Sep 18, 2017Updated 8 years ago
- Use basic deep reinforcement learning to solve Doom health gathering environment☆27Jun 21, 2018Updated 7 years ago
- Rosetta FunFolDes – a general framework for the computational design of functional proteins.☆20Apr 12, 2019Updated 7 years ago
- ☆14Sep 11, 2022Updated 3 years ago
- Simulate differential equations using TensorFlow☆19May 21, 2017Updated 9 years ago
- Repository for paper: "SnAKe: Bayesian Optimization with Pathwise Exploration".☆17Feb 9, 2024Updated 2 years ago
- Black Box Variational Inference for Bayesian logistic regression☆18Apr 1, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A system for computational category theory and applications☆41Jun 27, 2016Updated 9 years ago
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆27Aug 5, 2022Updated 3 years ago
- Magnify motion in videos with Riesz pyramids.☆22Mar 27, 2015Updated 11 years ago
- Workflow to download, process, and explore microbial RNA-seq data from NCBI SRA☆17Feb 29, 2024Updated 2 years ago
- Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods☆29Jul 31, 2018Updated 7 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆25Dec 30, 2021Updated 4 years ago