seolhokim / Mujoco-Pytorch
PPO, DDPG, SAC implementation on mujoco environment
☆96Updated 3 years ago
Alternatives and similar repositories for Mujoco-Pytorch:
Users that are interested in Mujoco-Pytorch are comparing it to the libraries listed below
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆146Updated 2 years ago
- ☆102Updated last week
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆164Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆340Updated 3 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆342Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆157Updated 3 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆97Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆524Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆131Updated 9 months ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆63Updated 8 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆127Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆302Updated 9 months ago
- NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms☆339Updated 10 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆206Updated 4 years ago
- ☆88Updated last year
- ☆191Updated last year
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆426Updated 9 months ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆49Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆130Updated 8 months ago
- ☆247Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆123Updated this week
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆149Updated 7 months ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆122Updated 6 months ago
- This is the official implementation of Multi-Agent PPO.☆102Updated 2 years ago
- ☆368Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆159Updated 9 months ago
- Transformer in RL for decision-making☆87Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆147Updated last year
- A PyTorch implementation of Implicit Q-Learning☆71Updated 3 years ago