MyRepositories-hub / Simple-Policy-Optimization
☆59Updated 2 months ago
Alternatives and similar repositories for Simple-Policy-Optimization:
Users that are interested in Simple-Policy-Optimization are comparing it to the libraries listed below
- ☆39Updated 3 weeks ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆20Updated 6 months ago
- NeurIPS 2024 DACER☆101Updated this week
- ☆102Updated 2 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆82Updated last year
- rl-papers☆47Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- ☆96Updated last year
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆25Updated 3 weeks ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"☆68Updated 3 months ago
- Code for running RL experiments on continuing (non-episodic) problems.☆17Updated last week
- Robust and safe deep reinforcement learning algorithms☆13Updated last year
- [ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"☆95Updated 2 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆61Updated 10 months ago
- ☆29Updated last year
- ☆42Updated 3 years ago
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆33Updated 3 years ago
- ☆21Updated 8 months ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆40Updated 5 months ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 2 years ago
- ☆23Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆53Updated 2 years ago
- A novel Hierarchical Imitation Learning algorithm based on AIRL.☆22Updated last year
- Transformer in RL for decision-making☆98Updated 2 years ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆31Updated last year
- Model-Free Safe Reinforcement Learning through Neural Barrier Certificate☆33Updated 11 months ago
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago