lmzintgraf / gp_pref_elicitLinks
Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes
☆23Updated 6 years ago
Alternatives and similar repositories for gp_pref_elicit
Users that are interested in gp_pref_elicit are comparing it to the libraries listed below
Sorting:
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Updated 4 years ago
- Code associated with paper "High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization"☆15Updated 4 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆11Updated 2 years ago
- NeurIPS 2019 Paper☆11Updated 5 years ago
- Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.☆47Updated 4 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Updated 2 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆26Updated 3 years ago
- PyTorch implementation of various reinforcement learning algorithms☆18Updated 7 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- ☆35Updated 6 years ago
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆28Updated 3 years ago
- Python package for Preference Learning with Gaussian Processes.☆33Updated 3 years ago
- Re-Examining Linear Embeddings for High-dimensional Bayesian Optimization☆41Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 6 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆52Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Entropy Search for Information-Efficient Global Optimization - JMLR v13☆29Updated 8 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆32Updated 7 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆11Updated 4 years ago
- Multi-fidelity Gaussian Process Bandit Optimisation☆39Updated 8 years ago
- Code repository for Ensemble Bayesian Optimization☆53Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 6 years ago
- ☆50Updated 3 years ago
- Companion code for RSS 2020 paper: "Active Preference-Based Gaussian Process Regression for Reward Learning"☆39Updated last year
- An adaptive black-box optimization method with directional Gaussian smoothing for high-dimensional multi-modal functions☆9Updated 4 years ago
- Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).☆13Updated 3 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Updated 6 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Updated 6 years ago