Policy gradient reinforcement learning algorithm with importance sampling
☆33Oct 6, 2017Updated 8 years ago
Alternatives and similar repositories for policy-gradient-importance-sampling
Users that are interested in policy-gradient-importance-sampling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Sep 1, 2017Updated 8 years ago
- ☆11Jan 20, 2016Updated 10 years ago
- Reinforcement Leanring for Tetris☆19Oct 24, 2016Updated 9 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Improved Training of Wasserstein GANs for Neural Machine Translation☆11Dec 11, 2017Updated 8 years ago
- ☆20Apr 27, 2016Updated 9 years ago
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated 8 months ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- Low-rank Highway Networks☆13Mar 11, 2016Updated 10 years ago
- PyTorch Implementation of REINFORCE for both discrete & continuous control☆267Apr 16, 2017Updated 8 years ago
- Relative gradient optimization of the Jacobian term in unsupervised deep learning, NeurIPS 2020☆21Apr 27, 2021Updated 4 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 8 years ago
- Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding☆14Sep 28, 2017Updated 8 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆20Jan 15, 2024Updated 2 years ago
- Some paper reading notes☆16Oct 13, 2018Updated 7 years ago
- Tensor Switching Networks☆12Nov 2, 2017Updated 8 years ago
- 作業系統實作☆13Apr 26, 2018Updated 7 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Dec 26, 2019Updated 6 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"☆14Oct 10, 2022Updated 3 years ago
- Generalized Optimal Transport Attention with Trainable Priors☆26Jan 25, 2026Updated 2 months ago
- ☆25Jan 20, 2022Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Analogs of Linguistic Structure in Deep Representations☆19Jul 27, 2017Updated 8 years ago
- Matlab code for the area under the receiver operating curve (AUC) and confidence intervals☆16Nov 10, 2014Updated 11 years ago
- Port of tasbot to linux☆10Apr 25, 2013Updated 12 years ago
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆16Nov 21, 2022Updated 3 years ago
- The labs of ARC university courses☆12Aug 29, 2023Updated 2 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- Translate - a PyTorch Language Library☆10Mar 14, 2019Updated 7 years ago
- ☆38Mar 6, 2017Updated 9 years ago
- A Deep Generative Distance-Based Classifier for Out-of-Domain Detection with Mahalanobis Space☆12Jun 21, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- ROS package suite for robots at Hakuto, a Google XPRIZE contender☆12Apr 27, 2016Updated 9 years ago
- Caffe code and prototxt files for the CNN Design Patterns paper☆56Nov 8, 2016Updated 9 years ago
- ☆13Jul 25, 2019Updated 6 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Python implementation of the NEAT neuroevolution algorithm☆12Dec 13, 2019Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago