HumanCompatibleAI / learning_biasesLinks

Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.

☆25

Alternatives and similar repositories for learning_biases

Users that are interested in learning_biases are comparing it to the libraries listed below

Sorting:

jannerm / gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆44Updated last year
roosephu / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Updated 6 years ago
nnaisense / MAX
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆79Updated 6 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
qxcv / magical
The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)
☆77Updated last year
deep-skill-chaining / deep-skill-chaining
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆29Updated 5 years ago
orybkin / video-gcp
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
☆45Updated 2 years ago
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆69Updated 6 years ago
KyriacosShiarli / taco
☆25Updated 6 years ago
dsbrown1331 / bayesianrex
☆20Updated 4 years ago
facebookresearch / icp-block-mdp
Invariant Causal Prediction for Block MDPs
☆44Updated 5 years ago
Stanford-ILIAD / batch-active-preference-based-learning
Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…
☆29Updated 6 years ago
ruizhaogit / mep
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24Updated 6 years ago
facebookresearch / CausalSkillLearning
Codebase for project about unsupervised skill learning via variational inference and causality.
☆43Updated last year
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆63Updated last year
Stanford-ILIAD / ELLA
Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.
☆21Updated 4 years ago
tgangwani / BMIL
Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)
☆21Updated 3 years ago
ElisevanderPol / mdp-homomorphic-networks
☆29Updated 4 years ago
arushijain94 / SafeOptionCritic
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
☆20Updated 6 years ago
denisyarats / proto
Proto-RL: Reinforcement Learning with Prototypical Representations
☆82Updated 3 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
tonyzhaozh / meld
MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957
☆63Updated 4 years ago
xkianteb / dril
Disagreement-Regularized Imitation Learning
☆30Updated 4 years ago
hari-sikchi / LOOP
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆40Updated 2 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
joeybose / FloRL
Implicit Normalizing Flows + Reinforcement Learning
☆61Updated 6 years ago
google-research / clevr_robot_env
CLEVR-Robot: a reinforcement learning environment combining vision, language and control.
☆135Updated last year
montrealrobotics / active-domainrand
Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)
☆98Updated 4 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago