mhyrzt / Simple-MADRL-ChessLinks
MADRL project solving chess environment using PPO with two different methods: 2 agents/networks and a single agent/network.
☆18Updated 2 years ago
Alternatives and similar repositories for Simple-MADRL-Chess
Users that are interested in Simple-MADRL-Chess are comparing it to the libraries listed below
Sorting:
- OpenAi gym environment for the Rubik's Cube (3x3x3).☆13Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆16Updated 3 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆83Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 2 years ago
- Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch fra…☆140Updated 2 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29Updated 3 months ago
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆134Updated 2 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Updated 3 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆109Updated 3 years ago
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆50Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated last month
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆32Updated 4 years ago
- ☆37Updated 2 years ago
- ☆31Updated 4 years ago
- Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.☆82Updated 5 years ago
- Minimal code for A Generalist Agent☆42Updated 2 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆37Updated 2 years ago
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆76Updated 10 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated last year
- All the source codes and lectures of reinforcement learning.☆32Updated 5 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆97Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated last year
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Updated last year
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 4 years ago