CyCTW / Parallel-MCTS
Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.
☆46Updated 4 years ago
Alternatives and similar repositories for Parallel-MCTS
Users that are interested in Parallel-MCTS are comparing it to the libraries listed below
Sorting:
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 4 years ago
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆36Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated 2 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆40Updated 4 years ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆106Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 2 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆75Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 8 months ago
- DecentralizedLearning☆24Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- Model-based Policy Gradients☆31Updated 5 years ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆65Updated last month
- ☆12Updated last year
- Multi-Objective Reinforcement Learning components built on top of RL glue components☆29Updated 2 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 4 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆40Updated 2 years ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆34Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆77Updated last year
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Updated last year
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 3 years ago