Proximal Policy Optimization with Stein Control Variates:
☆33Feb 12, 2018Updated 8 years ago
Alternatives and similar repositories for PPO-Stein-Control-Variate
Users that are interested in PPO-Stein-Control-Variate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stein Variational Policy Gradient for REINFORCE☆18Jul 12, 2017Updated 8 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Code release for the ICLR paper☆21Jun 13, 2018Updated 7 years ago
- Tutorial on continuous control at Reinforcement Learning Summer School 2017.☆34Jul 3, 2017Updated 8 years ago
- ☆10Apr 2, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated last month
- ☆29Nov 21, 2022Updated 3 years ago
- Experiments of amortized stein variational gradient☆16Apr 30, 2017Updated 8 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- TD-VAE in PyTorch☆10May 28, 2019Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆275Apr 18, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆86Apr 10, 2021Updated 5 years ago
- Public accompanying repository for Universite de Montreal's IFT 6757: Autnonomous Vehicles, Fall 2019.☆12Jun 21, 2022Updated 3 years ago
- Code for the paper "Evolved Policy Gradients"☆254Nov 22, 2018Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆437Nov 28, 2023Updated 2 years ago
- ☆13May 15, 2025Updated 11 months ago
- PyTorch implementation of Stein Variational Gradient Descent☆48Jun 16, 2023Updated 2 years ago
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 6 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆60Jul 12, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Reinforcement learning benchmarking.☆39Oct 22, 2018Updated 7 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Feb 28, 2018Updated 8 years ago
- Research project - real-time multi-agent pursuit a moving target☆17Mar 13, 2021Updated 5 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Dec 8, 2022Updated 3 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Mar 24, 2023Updated 3 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Oct 9, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 3 years ago
- Normalizing Flows in Jax☆109Aug 19, 2020Updated 5 years ago
- Proximal Policy Optimization with TensorFlow and OpenAI Gym☆18Mar 31, 2018Updated 8 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- An experiment with Thompson sampling and TD(0) on a grid world variant☆17Nov 8, 2013Updated 12 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆51Jun 7, 2021Updated 4 years ago