lmzintgraf / gp_pref_elicitLinks
Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes
☆23Updated 7 years ago
Alternatives and similar repositories for gp_pref_elicit
Users that are interested in gp_pref_elicit are comparing it to the libraries listed below
Sorting:
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Updated 4 years ago
- Code associated with paper "High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization"☆15Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.☆48Updated 4 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Updated 6 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆32Updated 7 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Updated 2 years ago
- Companion code for RSS 2020 paper: "Active Preference-Based Gaussian Process Regression for Reward Learning"☆39Updated last year
- Model Primitive Hierarchical Reinforcement Learning☆13Updated 2 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆12Updated 2 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆32Updated 5 years ago
- Meta-learning Gaussian process (GP) priors via PAC-Bayes bounds☆26Updated last year
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 3 years ago
- NeurIPS 2019 Paper☆11Updated 5 years ago
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings☆97Updated 7 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆11Updated 4 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 6 years ago
- ☆51Updated 3 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 3 years ago
- ☆54Updated 7 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 5 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Updated 8 years ago
- ☆31Updated last year
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Updated 6 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago