Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
Alternatives and similar repositories for PPO-Stein-Control-Variate
Users that are interested in PPO-Stein-Control-Variate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stein Variational Policy Gradient for REINFORCE☆18Jul 12, 2017Updated 8 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Code release for the ICLR paper☆21Jun 13, 2018Updated 7 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Jul 3, 2017Updated 8 years ago
- ☆10Apr 2, 2018Updated 7 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated last week
- Experiments of amortized stein variational gradient☆16Apr 30, 2017Updated 8 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- TD-VAE in PyTorch☆10May 28, 2019Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- ☆86Apr 10, 2021Updated 4 years ago
- Public accompanying repository for Universite de Montreal's IFT 6757: Autnonomous Vehicles, Fall 2019.☆12Jun 21, 2022Updated 3 years ago
- Code for the paper "Evolved Policy Gradients"☆254Nov 22, 2018Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"☆420Mar 21, 2024Updated 2 years ago
- A tensorflow implementation of VAE training with Renyi divergence☆31Sep 14, 2016Updated 9 years ago
- ☆13May 15, 2025Updated 10 months ago
- PyTorch implementation of Stein Variational Gradient Descent☆48Jun 16, 2023Updated 2 years ago
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 5 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆61Jul 12, 2019Updated 6 years ago
- Reinforcement learning benchmarking.☆39Oct 22, 2018Updated 7 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- SparseMax activation function implementation (ICML 2016) (PyTorch)☆28Nov 30, 2017Updated 8 years ago
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Feb 28, 2018Updated 8 years ago
- Research project - real-time multi-agent pursuit a moving target☆17Mar 13, 2021Updated 5 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Dec 8, 2022Updated 3 years ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Models built with TensorFlow☆26Dec 5, 2018Updated 7 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Mar 24, 2023Updated 2 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Oct 9, 2018Updated 7 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 2 years ago