baturaysaglam / LA3P
Actor Prioritized Experience Replay
☆11Updated 10 months ago
Related projects: ⓘ
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- ☆28Updated last year
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆31Updated 2 months ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆21Updated 7 months ago
- Meta RL codebase for Unstable Baselines☆20Updated last year
- ☆12Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆45Updated 3 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆70Updated 9 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- ☆44Updated 3 years ago
- ☆28Updated 3 years ago
- solving ml10☆11Updated 10 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆27Updated 6 months ago
- Model-based Offline Policy Optimization re-implement all by pytorch☆27Updated last year
- A PyTorch implementation of Implicit Q-Learning☆66Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 2 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆17Updated last year
- ☆20Updated this week
- ☆51Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆83Updated last year
- code for the paper Offline Prioritized Experience Replay☆13Updated last year
- ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward☆19Updated last year
- ☆18Updated 7 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 8 months ago
- Google Research Football MARL Benchmark and Research Toolkit☆28Updated 4 months ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆65Updated last week
- RL Algorithms for Visual Continuous Control☆30Updated last year
- Distributional Soft Actor Critic☆49Updated 4 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆23Updated last year
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆41Updated 2 years ago