There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
☆18Aug 16, 2017Updated 8 years ago
Alternatives and similar repositories for pg_rnn
Users that are interested in pg_rnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Aug 18, 2016Updated 9 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- Julia package for transfer operator spectral methods☆11Aug 14, 2024Updated last year
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- path planning with RRTs in Python!☆14Feb 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- Here we will to store papers from bayesgroup.ru☆11Dec 15, 2016Updated 9 years ago
- ☆10Nov 12, 2020Updated 5 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- The Matlab Code for the AISTATS 2015 paper "Learning Deep Sigmoid Belief Network with Data Augmentation"☆13Sep 20, 2015Updated 10 years ago
- PyTorch implementation of ICML 2017 paper, SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Paral…☆17Oct 24, 2017Updated 8 years ago
- COMS30017 Computational Neuroscience☆11Jan 7, 2022Updated 4 years ago
- MPC trajectory tracking + reachability-based collision avoidance for pairwise vehicle interactions☆19Jul 10, 2020Updated 5 years ago
- ☆19Apr 20, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- OpenAI Gym environment for DART robotics simulator.☆22Apr 17, 2018Updated 8 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"☆84May 29, 2017Updated 9 years ago
- Typescript dictionary for Node.JS objects providing associative array support.☆13Dec 8, 2022Updated 3 years ago
- Monte Carlo password checking☆11Aug 14, 2017Updated 8 years ago
- SocialCompliantRobot☆16Oct 10, 2023Updated 2 years ago
- smarc ros2-humble main repository☆19May 21, 2026Updated last week
- training BART from scratch☆12Dec 31, 2021Updated 4 years ago
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- Computational Neuroscience 3rd year CS course at the University of Bristol☆13Jul 19, 2022Updated 3 years ago
- Coqtail is a library of mathematical theorems and tools proved inside the Coq proof assistant. Results range mostly from arithmetic to re…☆16Apr 13, 2026Updated last month
- Improved Training of Wasserstein GANs for Neural Machine Translation☆11Dec 11, 2017Updated 8 years ago
- Python version of the OMEN password cracker☆17Dec 17, 2024Updated last year
- Use basic deep reinforcement learning to solve Doom health gathering environment☆27Jun 21, 2018Updated 7 years ago
- Rosetta FunFolDes – a general framework for the computational design of functional proteins.☆21Apr 12, 2019Updated 7 years ago
- 下载B站的视频,保存为mp4,再转换为mp3格式(听歌小助手)☆13Nov 19, 2020Updated 5 years ago
- TensorFlow implementation of Faster RCNN for Object Detection☆16Apr 10, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FineTune HuggingFace's T5 implementation on NMT☆11Aug 25, 2020Updated 5 years ago
- Simulate differential equations using TensorFlow☆19May 21, 2017Updated 9 years ago
- A julia package for bayesian optimization of black box functions.☆23Dec 4, 2020Updated 5 years ago
- A framework for password-strength evaluation☆14Sep 26, 2020Updated 5 years ago
- ☆39Jun 2, 2023Updated 2 years ago
- Quadrotor simulator mainly purposed to train neural network to control quadrotor flight via deep q learning algorithm☆27Aug 5, 2022Updated 3 years ago
- chrome浏览器插件,密码生成器☆13Aug 1, 2019Updated 6 years ago