maywind23 / LSTM-RL
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
☆10Updated 2 years ago
Alternatives and similar repositories for LSTM-RL
Users that are interested in LSTM-RL are comparing it to the libraries listed below
Sorting:
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆33Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆54Updated 2 years ago
- Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.☆57Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆62Updated 11 months ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆26Updated 5 years ago
- Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.☆57Updated 3 years ago
- A novel Hierarchical Imitation Learning algorithm based on AIRL.☆22Updated last year
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆73Updated 3 months ago
- Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts" at ICLR2024☆20Updated 6 months ago
- behavior cloning from observation☆34Updated 4 years ago
- ☆49Updated 3 months ago
- ☆46Updated last month
- This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…☆26Updated last year
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆75Updated last year
- Official code for "DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning" (NeurIPS 2022 Oral)☆27Updated 2 years ago
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆60Updated 2 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆34Updated 2 years ago
- ☆72Updated last year
- DSAC; Distributional Soft Actor-Critic☆125Updated 3 months ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆85Updated last year
- Implementation of PPO Lagrangian in PyTorch☆45Updated 2 years ago
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆34Updated last year
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆40Updated 3 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 5 months ago
- ☆39Updated 2 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆91Updated 8 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- Robust and safe deep reinforcement learning algorithms☆14Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year