cor3bit / bertsekas-marl
PyTorch Implementation of the Sequential Multiagent Rollout algorithm
☆10Updated 10 months ago
Alternatives and similar repositories for bertsekas-marl
Users that are interested in bertsekas-marl are comparing it to the libraries listed below
Sorting:
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆31Updated 5 months ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆26Updated 3 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆33Updated 2 years ago
- ☆31Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆70Updated 2 years ago
- ☆30Updated last year
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆65Updated 11 months ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆14Updated last year
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆41Updated 4 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆42Updated 8 months ago
- ☆12Updated 2 years ago
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆24Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆25Updated 9 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated 10 months ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆24Updated 2 years ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- ☆11Updated last year
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆64Updated 4 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 3 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆25Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago
- ☆11Updated last year
- Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)☆46Updated 5 months ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 2 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated 2 years ago
- Anti exploration in offline reinforcement learning☆9Updated 4 years ago
- ☆10Updated 2 years ago