ymzhang01 / focopsLinks
Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).
☆29Updated 3 years ago
Alternatives and similar repositories for focops
Users that are interested in focops are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆173Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆180Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆68Updated 2 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Updated 2 years ago
- Implementations of SAILR, PDO, and CSC☆31Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆189Updated 3 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆71Updated 2 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆71Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆153Updated 2 years ago
- ☆203Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆81Updated 2 years ago
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆27Updated 2 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆26Updated 5 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆219Updated last year
- ☆57Updated 2 years ago
- Conservative Q Learning on top of SAC☆132Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆151Updated 4 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆223Updated last year
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆39Updated 8 months ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆37Updated 3 years ago
- Implementations of safe reinforcement learning algorithms☆29Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆92Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learning☆91Updated 4 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆79Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆139Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆67Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆58Updated 3 years ago