yusme / LSPILinks

Least-Squares Policy Iteration

☆8

Alternatives and similar repositories for LSPI

Users that are interested in LSPI are comparing it to the libraries listed below

Sorting:

akifumi-wachi-4 / safe_near_optimal_mdp
Safe Reinforcement Learning in Constrained Markov Decision Processes
☆60Updated 4 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
☆146Updated last year
carolinewang01 / naht
Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).
☆20Updated 4 months ago
AgrawalAmey / safe-explorer
Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]
☆72Updated 6 years ago
uoe-agents / lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
☆46Updated 8 months ago
TJU-DRL-LAB / self-supervised-rl
☆38Updated 3 years ago
lucaslingle / pytorch_rl2
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
☆63Updated 3 years ago
SapanaChaudhary / PyTorch-CPO
PyTorch implementation of Constrained Policy Optimization
☆54Updated 3 years ago
Felhof / DiscreteSAC
☆40Updated 3 years ago
zaiyan-x / RFQI
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
☆24Updated 6 months ago
d3sm0 / gym_pomdp
Gym-like extensions for POMDP
☆57Updated 4 years ago
mgerstgrasser / super
suPER is a collaborative multi-agent RL algorithm
☆13Updated 11 months ago
ZiyuanMa / R2D2
An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch
☆45Updated 2 years ago
mit-gfx / PGMORL
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
☆113Updated 4 years ago
rcheng805 / RL-CBF
☆153Updated 6 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆168Updated 6 months ago
DesikRengarajan / EMRLD
[NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
☆12Updated 2 years ago
jqueeney / robust-safe-rl
Robust and safe deep reinforcement learning algorithms
☆14Updated last year
alirezakazemipour / SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
☆27Updated 3 weeks ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆144Updated last year
zbzhu99 / Constrained-Decision-Making-Paper-List
Paper list for constrained policy optimization in reinforcement learning.
☆72Updated last year
vermouth1992 / safe_rl_papers
A list of safe reinforcement learning papers
☆20Updated 5 years ago
zlr20 / saferl_kit
☆74Updated last year
chauncygu / Safe-Multi-Agent-Mujoco
Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.
☆62Updated 11 months ago
AlgTUDelft / WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"
☆55Updated last year
akjayant / PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
☆46Updated 2 years ago
seolhokim / InverseRL-Pytorch
Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation
☆65Updated 4 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆100Updated 3 years ago
CORE-Robotics-Lab / MAGIC
Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21
☆84Updated last year