lmzintgraf / gp_pref_elicitLinks
Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes
☆23Updated 7 years ago
Alternatives and similar repositories for gp_pref_elicit
Users that are interested in gp_pref_elicit are comparing it to the libraries listed below
Sorting:
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Updated 4 years ago
- Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.☆47Updated 4 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆32Updated 7 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 6 years ago
- Code associated with paper "High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization"☆15Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Bayes-Adaptive Monte-Carlo Planning algorithm☆17Updated 12 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 8 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 3 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Updated 2 years ago
- Meta-learning Gaussian process (GP) priors via PAC-Bayes bounds☆26Updated last year
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 6 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 3 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆52Updated 6 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆11Updated 2 years ago
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings☆96Updated 7 years ago
- ☆50Updated 3 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Safe Bayesian Optimization☆148Updated 2 years ago
- An adaptive black-box optimization method with directional Gaussian smoothing for high-dimensional multi-modal functions☆9Updated 4 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Updated 2 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- ☆54Updated 7 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 5 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆12Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago