raincchio / P3OLinks
☆11Updated last year
Alternatives and similar repositories for P3O
Users that are interested in P3O are comparing it to the libraries listed below
Sorting:
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Updated 2 years ago
- ☆12Updated 4 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆13Updated last year
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆21Updated 2 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆12Updated 3 years ago
- ☆20Updated 2 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Updated 4 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆14Updated last year
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆21Updated 3 years ago
- This is the code for Q-value Path Decomposition for Deep Multiagent Reinforcement Learning (NeurIPS 2019).☆11Updated 6 years ago
- ☆12Updated 2 years ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆10Updated 2 years ago
- ☆11Updated 4 years ago
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Updated last year
- Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning☆15Updated 5 years ago
- ☆11Updated 3 years ago
- ☆12Updated 4 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Updated 2 years ago
- ☆25Updated 3 years ago
- CAM ,DRL, Gated ,Multi-Attention☆9Updated 3 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Updated 7 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated 11 months ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Updated 3 years ago
- ☆28Updated 3 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆13Updated 3 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆15Updated 3 years ago
- ☆18Updated 4 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆20Updated 3 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 6 months ago