kantneel / causal-metarlLinks

WIP implementation of https://arxiv.org/pdf/1901.08162.pdf

☆9

Alternatives and similar repositories for causal-metarl

Users that are interested in causal-metarl are comparing it to the libraries listed below

Sorting:

jparkerholder / PB2
Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.
☆20Updated 4 years ago
TomZahavy / CB_AE_DQN
Contextual Bandits Action Elimination DQN
☆21Updated 7 years ago
yudasong / briee
Representation Learning in RL
☆14Updated 3 years ago
LinZichuan / AdMRL
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Updated 4 years ago
illidanlab / rpg
Ranking Policy Gradient
☆23Updated 5 years ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
YyzHarry / SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Updated 5 years ago
ml-jku / align-rudder
Code to reproduce results on toy tasks and companion blog for the paper.
☆20Updated 3 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
ha0ransun / Path-Auxiliary-Sampler
☆11Updated 2 years ago
zt95 / infinite-horizon-off-policy-estimation
☆13Updated 6 years ago
google-research / deep_ope
☆86Updated 11 months ago
haoliuhl / taming-maml
Taming MAML: efficient unbiased meta-reinforcement learning
☆29Updated 2 years ago
epignatelli / discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆21Updated 4 years ago
tuomaso / radial_rl
Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"
☆33Updated last year
LiuShuai26 / Distributed-RL
Distributed DRL by Ray and TensorFlow Tutorial.
☆10Updated 5 years ago
KyunghyunLee / aes-rl
☆17Updated 4 years ago
clvoloshin / constrained_batch_policy_learning
☆27Updated 5 years ago
llan-ml / tesp
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆34Updated 6 years ago
tianjunz / MADE
☆19Updated 3 years ago
yiqiwang8177 / Official-codebase-for-Decision-Transducer
This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…
☆11Updated last year
Xingyu-Lin / auxiliary-tasks-rl
Code for the paper Adaptive Auxiliary Task Weighting for Reinforcement Learning
☆26Updated 5 years ago
sfujim / SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆17Updated 3 years ago
cle-ros / RoutingNetworks
☆66Updated 4 years ago
FerranAlet / modular-metalearning
☆78Updated 4 years ago
CausalRL / DRL
Deconfounding Reinforcement Learning in Observational Settings
☆52Updated 6 years ago
ARM-gradient / ARSM
Low-variance and unbiased gradient for backpropagation through categorical random variables, with application in variational auto-encoder…
☆17Updated 5 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
kaixin96 / mixreg
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆33Updated 4 years ago
chandar-lab / Lifelong-Hanabi
A Continual Multi-agent RL testbed based on Hanabi
☆30Updated 3 years ago