mhyrzt / Simple-MADRL-Chess
MADRL project solving chess environment using PPO with two different methods: 2 agents/networks and a single agent/network.
☆18Updated 2 years ago
Alternatives and similar repositories for Simple-MADRL-Chess
Users that are interested in Simple-MADRL-Chess are comparing it to the libraries listed below
Sorting:
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Implicit Distributional Actor Critic☆11Updated 3 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Updated 4 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- The official implementation of Memory-efficient DQN algorithm.☆10Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆49Updated 2 years ago
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆26Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆28Updated this week
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Write simple games in Numpy!☆12Updated 2 years ago
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆16Updated 3 years ago
- ☆12Updated 4 years ago
- Multi-agent active perception with prediction rewards☆12Updated 4 years ago
- ☆8Updated 4 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆12Updated last year
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆78Updated last year
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆20Updated 4 months ago
- Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"☆20Updated 2 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆9Updated 7 months ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14Updated 3 years ago
- ☆16Updated 4 years ago
- PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presenta…☆10Updated last year
- This is the code of reproducing the results of our paper: On the importance of Hyperparameter Optimization for Model-based Reinforcement …☆15Updated 3 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆25Updated 5 months ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated 2 years ago
- Information Design in Multi-Agent Reinforcement Learning☆14Updated last year