datvodinh / ppo-transformer
A Reinforcement Learning Project using PPO + Transformer
☆32Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ppo-transformer
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆152Updated 5 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆148Updated 4 months ago
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆32Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆125Updated 6 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆69Updated last year
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆81Updated last year
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆50Updated 3 years ago
- PPO with multi-head/autoregressive action outputs☆36Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆144Updated last week
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆176Updated 2 months ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆170Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆43Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆161Updated 2 months ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- Transformer in RL for decision-making☆75Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆44Updated 3 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆66Updated 3 months ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆36Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆30Updated this week
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆72Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆156Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆58Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Partially Observable Process Gym☆167Updated 4 months ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆79Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago