cor3bit / bertsekas-marl
PyTorch Implementation of the Sequential Multiagent Rollout algorithm
☆10Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for bertsekas-marl
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆25Updated 2 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆35Updated 2 months ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆29Updated last year
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]☆22Updated 2 weeks ago
- Code for FOCAL Paper Published at ICLR 2021☆49Updated 11 months ago
- Representation Learning for RL☆119Updated last year
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 2 years ago
- ☆29Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)☆41Updated last year
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆62Updated last year
- Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss☆10Updated 2 years ago
- ☆26Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆116Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆156Updated 2 years ago
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆23Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- ☆30Updated last year
- Conservative Q learning in Jax☆51Updated last year
- ☆52Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆55Updated 10 months ago
- Code for Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations☆18Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆54Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated 2 months ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆25Updated last year
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆38Updated 4 years ago