lmzintgraf / gp_pref_elicitLinks

Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes

☆23

Alternatives and similar repositories for gp_pref_elicit

Users that are interested in gp_pref_elicit are comparing it to the libraries listed below

Sorting:

jparkerholder / ASEBO
Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …
☆16Updated 4 years ago
XanderJC / scalable-birl
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
☆47Updated 4 years ago
dtak / hip-mdp-public
Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆32Updated 7 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago
facebookresearch / ContextualBO
Code associated with paper "High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization"
☆15Updated 4 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
acguez / bamcp
Bayes-Adaptive Monte-Carlo Planning algorithm
☆17Updated 12 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
jinming99 / DGP-IRL
Deep Gaussian Process for Inverse Reinforcement Learning
☆33Updated 8 years ago
Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Updated 3 years ago
sisl / MPHRL
Model Primitive Hierarchical Reinforcement Learning
☆13Updated 2 years ago
jonasrothfuss / meta_learning_pacoh
Meta-learning Gaussian process (GP) priors via PAC-Bayes bounds
☆26Updated last year
llan-ml / tesp
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆34Updated 6 years ago
thanhnguyentang / offline_neural_bandits
An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…
☆13Updated 3 years ago
CausalRL / DRL
Deconfounding Reinforcement Learning in Observational Settings
☆52Updated 6 years ago
automl / LTO-CMA
Code for the paper "Learning Step-Size Adaptation in CMA-ES"
☆11Updated 2 years ago
wyndwarrior / Sectar
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
☆96Updated 7 years ago
rail-berkeley / design-bench
☆50Updated 3 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
befelix / SafeOpt
Safe Bayesian Optimization
☆148Updated 2 years ago
HoangATran / AdaDGS
An adaptive black-box optimization method with directional Gaussian smoothing for high-dimensional multi-modal functions
☆9Updated 4 years ago
johannesnauta / pytorch-pne
PyTorch implementation of Probabilistic Network Ensembles on toy problems
☆23Updated 2 years ago
hari-sikchi / safeRL
Safe Reinforcement Learning algorithms
☆74Updated 2 years ago
kazizzad / BDQN-MxNet-Gluon
Efficient Exploration through Bayesian Deep Q-Networks
☆37Updated 7 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
wyjung0625 / p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Updated 5 years ago
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆12Updated 3 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago