A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆122Apr 26, 2021Updated 5 years ago
Alternatives and similar repositories for WU-UCT
Users that are interested in WU-UCT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22May 5, 2021Updated 5 years ago
- GPU Monte Carlo Tree Search with MPI☆26Jan 9, 2019Updated 7 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 10 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated 2 years ago
- A curated list of Monte Carlo tree search papers with implementations.☆702Jan 13, 2026Updated 4 months ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Apr 13, 2017Updated 9 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆42Oct 8, 2020Updated 5 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- Monte Carlo Tree Search with Reinforcement Learning for Motion Planning☆79Sep 23, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Jun 30, 2020Updated 5 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆121Jul 25, 2024Updated last year
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆51Aug 27, 2022Updated 3 years ago
- ☆13Sep 14, 2021Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- Parallel Monte Carlo Tree Search with Batched Rigid-body Simulations☆31Aug 9, 2024Updated last year
- Mixed Sum-Product Networks: A Deep Architecture for Hybrid Domains☆16May 12, 2018Updated 8 years ago
- HyP-DESPOT: A Hybrid Parallel Algorithm for Online Planning under Uncertainty☆56Jan 4, 2024Updated 2 years ago
- Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models☆12Feb 17, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A framework for easy prototyping of distributed reinforcement learning algorithms☆97Dec 8, 2020Updated 5 years ago
- nonlinear solver for the constrained problem☆21Sep 18, 2023Updated 2 years ago
- Potential-Aware Imperfect-Recall Abstraction with Earth Mover’s Distance in Imperfect-Information Games☆16Nov 29, 2025Updated 6 months ago
- A reinforcement learning based solver for combinatorial problems☆43Jun 22, 2022Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 7 years ago
- MetaLight: a value-based meta-reinforcement learning framework for traffic signal control☆45Jan 13, 2020Updated 6 years ago
- Distributed Deep Reinforcement Learning☆30Jan 21, 2021Updated 5 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆18Mar 2, 2021Updated 5 years ago
- A Multi-threaded Implementation of AlphaZero (C++)☆387Jan 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- compare the theory attention gradient with PyTorch attention gradient☆16Apr 1, 2024Updated 2 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- A PyTorch wrapper of parallel exclusive scan in CUDA☆12May 25, 2023Updated 3 years ago
- Counterfactual Regret Minimization (CFR) sample code in Python☆14Apr 16, 2019Updated 7 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆29May 13, 2019Updated 7 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCT…☆1,591May 12, 2026Updated 2 weeks ago