mwufi / meta-rl-banditsLinks

A simple RNN meta-learner

☆10

Alternatives and similar repositories for meta-rl-bandits

Users that are interested in meta-rl-bandits are comparing it to the libraries listed below

Sorting:

watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆110Updated 4 years ago
Sonkyunghwan / QTRAN
There will be updates later
☆84Updated 6 years ago
IouJenLiu / CMAE
☆49Updated 3 years ago
chanb / metalearning_RL
☆20Updated 2 years ago
AnujMahajanOxf / MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
☆59Updated 3 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆170Updated 8 months ago
quantumiracle / Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…
☆31Updated 2 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆69Updated last year
kpaonaut / HAAR-A-Hierarchical-RL-Algorithm
Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
☆31Updated 2 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆103Updated 3 years ago
shariqiqbal2810 / REFIL
Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021
☆65Updated 4 years ago
feidieufo / homework
Assignments for CS294-112.
☆30Updated 5 years ago
TonghanWang / EITI-EDTI
Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)
☆33Updated 5 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated 11 months ago
YangRui2015 / Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
☆88Updated 5 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆170Updated 3 years ago
lucaslingle / pytorch_rl2
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
☆64Updated 3 years ago
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆137Updated last year
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆259Updated 5 years ago
jxu43 / replication-mbpo
NeurIPS Reproducibility Challenge 2019
☆20Updated 5 years ago
apexrl / bmpo
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Updated 2 years ago
ovechkin-dm / ppo-lstm-parallel
ppo-lstm-parallel
☆45Updated 6 years ago
kikojay / EMC
The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.
☆40Updated 2 years ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆179Updated 3 years ago
shlee94 / Off2OnRL
☆56Updated 2 years ago
yalidu / liir
Learning Individual Intrinsic Reward in MARL
☆63Updated 2 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 5 years ago
mzho7212 / LICA
[NeurIPS 2020] PyTorch implementation of "Learning Implicit Credit Assignment for Cooperative Muti-Agent Reinforcement Learning"
☆60Updated last year