maywind23 / LSTM-RL

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

☆10

Alternatives and similar repositories for LSTM-RL

Users that are interested in LSTM-RL are comparing it to the libraries listed below

Sorting:

Johnny-Zhang92 / IRL-Essential-Code
Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)
☆33Updated 3 years ago
Haichao-Zhang / PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆54Updated 2 years ago
abalakrishna123 / recovery-rl
Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.
☆57Updated last year
chauncygu / Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
☆62Updated 11 months ago
baimingc / delay-aware-MBRL
Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".
☆26Updated 5 years ago
CUN-bjy / policy-distillation-baselines
Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆57Updated 3 years ago
LucasCJYSDL / HierAIRL
A novel Hierarchical Imitation Learning algorithm based on AIRL.
☆22Updated last year
zbzhu99 / madiff
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
☆73Updated 3 months ago
AhmedMagdyHendawy / MOORE
Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024
☆20Updated 6 months ago
montaserFath / BCO
behavior cloning from observation
☆34Updated 4 years ago
intelligent-control-lab / guard
☆49Updated 3 months ago
TobiasLv / RAD
☆46Updated last month
akjayant / mbppol
This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…
☆26Updated last year
liuzuxin / cvpo-safe-rl
Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)
☆75Updated last year
jayLEE0301 / dhrl_official
Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning" (NeurIPS 2022 Oral)
☆27Updated 2 years ago
clvrai / skimo
Skill-based Model-based Reinforcement Learning (CoRL 2022)
☆60Updated 2 years ago
martius-lab / HiTS
Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021
☆34Updated 2 years ago
zlr20 / saferl_kit
☆72Updated last year
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆125Updated 3 months ago
zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆85Updated last year
akjayant / PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
☆45Updated 2 years ago
yeshenpy / RACE
(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…
☆34Updated last year
RITCHIEHuang / MAGAIL
Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning
☆40Updated 3 years ago
xihuai18 / A2PO-ICLR2023
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆29Updated 5 months ago
MDrW / ICML2022-IRAT
☆39Updated 2 years ago
YiqinYang / ICQ
Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…
☆74Updated 2 years ago
liuzuxin / DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
☆91Updated 8 months ago
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆87Updated last year
jqueeney / robust-safe-rl
Robust and safe deep reinforcement learning algorithms
☆14Updated last year
AlgTUDelft / WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
☆54Updated last year