dragen1860 / MAML-Pytorch-RLLinks

☆31

Alternatives and similar repositories for MAML-Pytorch-RL

Users that are interested in MAML-Pytorch-RL are comparing it to the libraries listed below

Sorting:

YangRui2015 / Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
☆88Updated 5 years ago
RchalYang / Soft-Module
Code for "Multi-task Reinforcement Learning with Soft Modularization"
☆122Updated 4 years ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 3 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
Theohhhu / UPDeT
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…
☆133Updated 4 years ago
twni2016 / Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆32Updated 3 years ago
facebookresearch / CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆130Updated last year
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆110Updated 4 years ago
mengf1 / CHER
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
☆65Updated 5 years ago
navneet-nmk / Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
☆60Updated 6 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆170Updated 8 months ago
qian18long / epciclr2020
☆121Updated 2 years ago
jatinarora2702 / gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
☆25Updated 4 years ago
RchalYang / torchrl
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
☆223Updated 3 years ago
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
dhruvramani / Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆180Updated 2 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
j3soon / dfac
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆32Updated 2 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
navuboy / gail_gym
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
☆89Updated 6 years ago
ling-pan / SD3
☆60Updated 4 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated this week
toshikwa / gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
☆223Updated 4 years ago
kristery / Imitation-Learning-from-Imperfect-Demonstration
[ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"
☆50Updated 6 years ago
jqueeney / geppo
Generalized Proximal Policy Optimization with Sample Reuse (GePPO)
☆25Updated last year
skumar9876 / Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…
☆86Updated 7 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated 11 months ago
polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆189Updated 7 months ago