maywind23 / LSTM-RLLinks
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
☆15Updated 3 years ago
Alternatives and similar repositories for LSTM-RL
Users that are interested in LSTM-RL are comparing it to the libraries listed below
Sorting:
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Updated 2 years ago
 - Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated last year
 - Meta-RL Model-Based Algorithm☆40Updated 6 months ago
 - Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.☆58Updated 4 years ago
 - Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆55Updated 4 years ago
 - ☆14Updated 5 years ago
 - PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43Updated 3 years ago
 - Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Updated 4 years ago
 - behavior cloning from observation☆36Updated 4 years ago
 - PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
 - Deep Reinforcement Learning Framework done with PyTorch☆38Updated 7 months ago
 - Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 5 years ago
 - A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆39Updated last year
 - Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆34Updated 11 months ago
 - [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆38Updated last year
 - Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆92Updated last year
 - Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Updated 4 years ago
 - multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)☆12Updated 6 years ago
 - A collection of recent MARL papers☆98Updated 11 months ago
 - Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆30Updated 2 years ago
 - Synthetic Experience Replay☆103Updated last year
 - PyTorch implementation of GAIL and AIRL based on PPO.☆227Updated 4 years ago
 - Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆28Updated 5 years ago
 - PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆88Updated 2 years ago
 - Clean baseline implementation of PPO using an episodic TransformerXL memory☆193Updated last year
 - MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
 - Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 3 months ago
 - Model-based Policy Gradients☆32Updated 5 years ago
 - DecentralizedLearning☆25Updated 2 years ago
 - PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆148Updated 3 years ago