Robust policy search algorithms which train on model ensembles
☆31Oct 26, 2016Updated 9 years ago
Alternatives and similar repositories for robustRL
Users that are interested in robustRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to train RL agents along with Adversarial distrubance agents☆66Mar 21, 2017Updated 9 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- ☆15Sep 25, 2019Updated 6 years ago
- Collaborative Deep Reinforcement Learning☆32Jul 29, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the Unsupervised learning by predicting noise paper☆14Jan 18, 2018Updated 8 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆48Apr 14, 2019Updated 7 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Code for "Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight"☆79Oct 18, 2017Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 8 years ago
- Train an RL agent to play multiple Atari games at once☆69Jun 6, 2016Updated 9 years ago
- ☆15Feb 15, 2017Updated 9 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆41Jan 27, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 10 months ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- Implementation of safe offline bandit algorithms.☆10Oct 27, 2019Updated 6 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆53Feb 16, 2020Updated 6 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆29Mar 27, 2021Updated 5 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Projected Overrelaxed Jacobi (JORProx) and Gauss-Seidel (SORProx) GPU implementations.☆13Jan 14, 2019Updated 7 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago
- A Python library for reinforcement learning using Bayesian approaches☆53May 14, 2015Updated 10 years ago
- ☆69May 26, 2018Updated 7 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆54Jul 10, 2017Updated 8 years ago
- My Body Is A Cage☆41Apr 13, 2021Updated 5 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- ☆18Jul 13, 2022Updated 3 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…☆31Nov 23, 2021Updated 4 years ago
- This is the repository for the Master of Science thesis titled "GAN-based Matrix Factorization for Recommender Systems".☆10Aug 10, 2020Updated 5 years ago
- ☆27Dec 2, 2017Updated 8 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Nov 14, 2019Updated 6 years ago
- ☆120Jul 9, 2020Updated 5 years ago
- ☆11Feb 11, 2024Updated 2 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆14Jul 3, 2016Updated 9 years ago