Wanqianxn / usfaLinks

Implementation of USFAs: https://arxiv.org/pdf/1812.07626.pdf

☆9

Alternatives and similar repositories for usfa

Users that are interested in usfa are comparing it to the libraries listed below

Sorting:

Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago
AnujMahajanOxf / MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
☆58Updated 3 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆182Updated 3 years ago
luchris429 / model-free-opponent-shaping
Code for Model-Free Opponent Shaping (ICML 2022)
☆19Updated 2 years ago
kandouss / marlgrid
Gridworld for MARL experiments
☆141Updated 4 years ago
mcmachado / count_based_exploration_sr
☆31Updated 6 years ago
erwanbou / sf-deep-rl
Project on Successor Features in Deep Reinforcement Learning and Transfer Learning
☆24Updated 7 years ago
twni2016 / f-IRL
Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020
☆45Updated 2 years ago
aravindsrinivas / curl_rainbow
☆53Updated 5 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆171Updated 8 months ago
davidbrandfonbrener / onestep-rl
☆42Updated 3 years ago
lmzintgraf / varibad
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
☆192Updated 2 years ago
facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆149Updated 3 years ago
jiangsy / mbpo_pytorch
☆29Updated 3 years ago
denisyarats / dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
☆219Updated last year
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
microsoft / ATAC
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆70Updated 2 years ago
dennisl88 / rand_param_envs
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
☆20Updated 6 years ago
proroklab / popgym
Partially Observable Process Gym
☆196Updated last month
spitis / mrl
☆113Updated 2 years ago
twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆329Updated 11 months ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago
fuyw / RepL4RL
Representation Learning for RL
☆126Updated 2 years ago
jsikyoon / dreamer-torch
Pytorch version of Dreamer, which follows the original TF v2 codes.
☆130Updated 3 years ago
GRAAL-Research / OfflineRLReadingGroup
Offline Reinforcement Learning Reading Group
☆28Updated 2 years ago
uoe-agents / lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
☆48Updated 10 months ago
mike-gimelfarb / deep-successor-features-for-transfer
A reusable framework for successor features for transfer in deep reinforcement learning using keras.
☆46Updated 4 years ago
young-geng / CQL
Conservative Q Learning on top of SAC
☆132Updated 2 years ago
clvoloshin / COBS
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Updated 3 years ago
marc-rigter / rambo
Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022
☆29Updated 2 years ago