yashbonde / Transformer-RLLinks

Experiments to train transformer network to master reinforcement learning environments.

☆32

Alternatives and similar repositories for Transformer-RL

Users that are interested in Transformer-RL are comparing it to the libraries listed below

Sorting:

danielwillemsen / MAMBPO
DecentralizedLearning
☆24Updated 2 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
chauncygu / Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
☆64Updated last year
thu-rllab / CFCQL
Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.
☆37Updated 5 months ago
jqueeney / geppo
Generalized Proximal Policy Optimization with Sample Reuse (GePPO)
☆25Updated 2 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆51Updated 2 years ago
acyclics / MPO
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆27Updated 4 years ago
jesbu1 / hidio
Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options
☆46Updated 3 years ago
Improbable-AI / eipo
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆82Updated 2 years ago
uoe-agents / derl
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆27Updated 3 years ago
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
tesslerc / ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…
☆46Updated 6 years ago
morning9393 / Optimal-Baseline-for-Multi-agent-Policy-Gradients
☆28Updated 3 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated this week
quantumiracle / nash-dqn
Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…
☆20Updated 2 years ago
zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆88Updated last year
sfujim / LAP-PAL
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
☆36Updated 3 years ago
SvenGronauer / Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
☆70Updated 2 years ago
navneet-nmk / Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
☆60Updated 6 years ago
sisl / DICG
Deep Implicit Coordination Graphs
☆41Updated last year
twni2016 / Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆63Updated last year
apexrl / CoDAIL
Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>
☆18Updated 4 years ago
sparkmxy / my-offlinerl
☆24Updated 3 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆27Updated 5 years ago
kpaonaut / HAAR-A-Hierarchical-RL-Algorithm
Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
☆32Updated 2 years ago
wendelinboehmer / dcg
☆75Updated last year
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago