datvodinh / ppo-transformer
A Reinforcement Learning Project using PPO + Transformer
☆28Updated last year
Related projects: ⓘ
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆35Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆130Updated 2 months ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆77Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆137Updated 3 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆42Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆67Updated last year
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆32Updated 3 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆118Updated 4 months ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆48Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆27Updated 3 weeks ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆161Updated last week
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆50Updated 11 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆125Updated this week
- PyTorch implementation of FQF, IQN and QR-DQN.☆158Updated last month
- PyTorch implementation of discrete version of Soft Actor-Critic.☆27Updated 3 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆102Updated last year
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆41Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆152Updated this week
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆21Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last month
- Minimal implementation of multi-agent reinforcement learning algorithms☆48Updated 3 years ago
- 🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.☆66Updated 3 years ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆99Updated 7 months ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆128Updated 5 years ago
- PyTorch implementation of DDPG for continuous control tasks.☆41Updated 4 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- Implementation for mSAC methods in PyTorch☆36Updated 2 years ago