There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
☆18Aug 16, 2017Updated 8 years ago
Alternatives and similar repositories for pg_rnn
Users that are interested in pg_rnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Apr 12, 2017Updated 9 years ago
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Aug 18, 2016Updated 9 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- ☆10Nov 12, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Jan 23, 2021Updated 5 years ago
- A deep learning package for computer vision algorithms built on top of TensorFlow☆11Sep 12, 2018Updated 7 years ago
- PyTorch implementation of ICML 2017 paper, SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Paral…☆17Oct 24, 2017Updated 8 years ago
- COMS30017 Computational Neuroscience☆11Jan 7, 2022Updated 4 years ago
- Human activity recognition using hidden Markov model.☆10Jan 7, 2018Updated 8 years ago
- NeurIPS 2020☆17Jun 18, 2021Updated 4 years ago
- Project Page for Generative Gaussian Splatting for Efficient 3D Content Creation☆16Feb 1, 2024Updated 2 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- Implementations of vanilla autoencoder, VAE, and GAN in Tensorflow☆18Jul 12, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- Computational Neuroscience 3rd year CS course at the University of Bristol☆12Jul 19, 2022Updated 3 years ago
- 西班牙短文本匹配比赛,初赛8/1027,复赛5/1027☆19Aug 1, 2018Updated 7 years ago
- tensorflow implementation of cnn localization project by CSAIL@MIT (CVPR'16)☆18Sep 18, 2017Updated 8 years ago
- ☆18Jun 3, 2024Updated last year
- Repository for paper: "SnAKe: Bayesian Optimization with Pathwise Exploration".☆17Feb 9, 2024Updated 2 years ago
- A julia package for bayesian optimization of black box functions.☆23Dec 4, 2020Updated 5 years ago
- A system for computational category theory and applications☆40Jun 27, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆27Aug 5, 2022Updated 3 years ago
- Master Thesis. Code written in python. (Keras with Tensorflow backend)☆23Jun 16, 2020Updated 5 years ago
- An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.☆29Oct 18, 2017Updated 8 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆23Dec 30, 2021Updated 4 years ago
- Recurrent Deterministic Policy Gradient actor-critic based Reinforcement Learning algorithm in Action☆36Feb 27, 2025Updated last year
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago
- A procedural geometry generator for rendering pipline.☆10Feb 23, 2019Updated 7 years ago
- Deprecated repository for "Deep Learning with Topological Signatures"☆37Mar 6, 2020Updated 6 years ago
- ☆12May 20, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆35Nov 17, 2021Updated 4 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- PCB design for the initial prototype of OBC hardware, to interface with LaunchPad.☆10Sep 19, 2018Updated 7 years ago
- ☆14Jun 25, 2022Updated 3 years ago
- akid is a python package written for doing research in Neural Network.☆14Mar 24, 2023Updated 3 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago
- Prototype web interface that enables remote teleoperation of the Stretch RE1 mobile manipulator from Hello Robot Inc.☆12Dec 14, 2023Updated 2 years ago