liuanji / WU-UCTView external linksLinks
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆122Apr 26, 2021Updated 4 years ago
Alternatives and similar repositories for WU-UCT
Users that are interested in WU-UCT are comparing it to the libraries listed below
Sorting:
- ☆22May 5, 2021Updated 4 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 6 years ago
- Demo of UCT (MCTS) in Python / Numpy☆88Dec 23, 2022Updated 3 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 6 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 9 years ago
- Implementation of Counterfactual risk minimization☆26Apr 13, 2017Updated 8 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated last year
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- code for the paper "Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation" (TPAMI 2021)☆10Jul 15, 2022Updated 3 years ago
- ☆10Feb 28, 2019Updated 6 years ago
- A curated list of Monte Carlo tree search papers with implementations.☆690Jan 13, 2026Updated last month
- Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models☆12Feb 17, 2020Updated 5 years ago
- Python 3.6 and TensorFlow implementation of the AReS and MaRS algorithms☆11Jun 23, 2019Updated 6 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27May 13, 2019Updated 6 years ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆51Aug 27, 2022Updated 3 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- A Python module for generating fast bilinear algorithms for different convolution algorithms☆16Feb 29, 2024Updated last year
- Inference on non-linear dynamical systems written in JAX☆11Aug 20, 2020Updated 5 years ago
- ☆11Jun 30, 2020Updated 5 years ago
- Distribution and filtering on SO(3) x Euclidean space☆12Jun 24, 2022Updated 3 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Jul 4, 2022Updated 3 years ago
- Implementation of MCTS algorithms in Munos (2014)☆13Aug 8, 2018Updated 7 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Jul 25, 2024Updated last year
- Lossless compression using Probabilistic Circuits☆16Mar 10, 2022Updated 3 years ago
- An extension for VS Code which provides support for the Nim language.☆13Sep 24, 2020Updated 5 years ago
- A squad movement planning library for StarCraft AI using Monte Carlo Tree Search and Negamax☆14Jan 1, 2019Updated 7 years ago
- This is the source code for solving the Traveling Salesman Problems (TSP) using Monte Carlo tree search (MCTS).☆34Sep 25, 2019Updated 6 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago
- Counterfactual Regret Minimization (CFR) sample code in Python☆14Apr 16, 2019Updated 6 years ago
- A fast sampling and analysis tool for biomolecules☆16Jan 20, 2025Updated last year
- ☆105Sep 25, 2019Updated 6 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆15Jan 18, 2019Updated 7 years ago
- Potential-Aware Imperfect-Recall Abstraction with Earth Mover’s Distance in Imperfect-Information Games☆16Nov 29, 2025Updated 2 months ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Dec 8, 2020Updated 5 years ago
- ☆19Jul 18, 2021Updated 4 years ago