lmzintgraf / gp_pref_elicit
Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes
☆23Updated 6 years ago
Alternatives and similar repositories for gp_pref_elicit:
Users that are interested in gp_pref_elicit are comparing it to the libraries listed below
- Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.☆45Updated 4 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- Companion code for RSS 2020 paper: "Active Preference-Based Gaussian Process Regression for Reward Learning"☆40Updated last year
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Code associated with paper "High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization"☆15Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 5 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- NeurIPS 2019 Paper☆11Updated 5 years ago
- PyTorch implementation of Probabilistic Network Ensembles on toy problems☆23Updated 2 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 6 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆32Updated 7 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Updated 4 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆26Updated 3 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Max-value Entropy Search for Efficient Bayesian Optimization☆73Updated 3 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 5 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- ☆54Updated 7 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- Python package for Preference Learning with Gaussian Processes.☆33Updated 3 years ago
- ☆27Updated 5 years ago
- ☆43Updated 3 years ago
- ☆49Updated 3 years ago
- Bayes-Adaptive Monte-Carlo Planning algorithm☆16Updated 12 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 5 years ago