yusme / LSPI
Least-Squares Policy Iteration
☆8Updated 2 years ago
Alternatives and similar repositories for LSPI:
Users that are interested in LSPI are comparing it to the libraries listed below
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆59Updated 4 years ago
- PyTorch implementation of Constrained Policy Optimization☆53Updated 3 years ago
- Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).☆18Updated 3 months ago
- ☆27Updated 4 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 5 years ago
- Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation☆65Updated 3 years ago
- ☆42Updated 3 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆47Updated 2 years ago
- Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.☆20Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year
- ☆20Updated last year
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆33Updated 5 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆63Updated 3 years ago
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆40Updated 9 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- ☆44Updated 4 years ago
- ☆37Updated 3 years ago
- Implementation of PPO Lagrangian in PyTorch☆44Updated 2 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- Distributional Soft Actor Critic☆52Updated 4 years ago
- Code for a model-based version of Constrained Policy Optimization☆10Updated 3 years ago
- ☆17Updated 4 years ago
- ☆40Updated 3 years ago
- ☆49Updated 3 years ago
- Simple implementation for Constrained Policy Optimization in Pytorch☆15Updated 2 years ago
- Transformer in RL for decision-making☆98Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆77Updated 5 months ago
- Constrained Policy Optimization implementation on Safety Gym☆25Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago