A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆122Apr 26, 2021Updated 4 years ago
Alternatives and similar repositories for WU-UCT
Users that are interested in WU-UCT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.☆50Jan 10, 2021Updated 5 years ago
- GPU Monte Carlo Tree Search with MPI☆26Jan 9, 2019Updated 7 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 7 years ago
- Demo of UCT (MCTS) in Python / Numpy☆88Dec 23, 2022Updated 3 years ago
- Lossless compression using Probabilistic Circuits☆16Mar 10, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated last year
- A curated list of Monte Carlo tree search papers with implementations.☆698Jan 13, 2026Updated 3 months ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 6 years ago
- Implementation of Counterfactual risk minimization☆26Apr 13, 2017Updated 9 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆42Oct 8, 2020Updated 5 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆26May 2, 2025Updated 11 months ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆120Jul 25, 2024Updated last year
- Pytorch implementation of distributed deep reinforcement learning☆76Jul 4, 2022Updated 3 years ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆51Aug 27, 2022Updated 3 years ago
- ☆13Sep 14, 2021Updated 4 years ago
- Code for our paper: "Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers".☆20Dec 28, 2021Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- Parallel Monte Carlo Tree Search with Batched Rigid-body Simulations☆31Aug 9, 2024Updated last year
- Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models☆12Feb 17, 2020Updated 6 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆97Dec 8, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Potential-Aware Imperfect-Recall Abstraction with Earth Mover’s Distance in Imperfect-Information Games☆16Nov 29, 2025Updated 4 months ago
- A reinforcement learning based solver for combinatorial problems☆43Jun 22, 2022Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- Implementation of MCTS algorithms in Munos (2014)☆13Aug 8, 2018Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- MetaLight: a value-based meta-reinforcement learning framework for traffic signal control☆44Jan 13, 2020Updated 6 years ago
- Distributed Deep Reinforcement Learning☆30Jan 21, 2021Updated 5 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- A Multi-threaded Implementation of AlphaZero (C++)☆388Jan 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Nov 22, 2019Updated 6 years ago
- This repository implements Distilled Graph Attention Policy Networks (DGAPNs), a curiosity-driven reinforcement learning model to generat…☆21Jan 21, 2022Updated 4 years ago
- A PyTorch wrapper of parallel exclusive scan in CUDA☆12May 25, 2023Updated 2 years ago
- Counterfactual Regret Minimization (CFR) sample code in Python☆14Apr 16, 2019Updated 7 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆28May 13, 2019Updated 6 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space☆20Oct 3, 2023Updated 2 years ago