heinrichjh / nfsp-leducView external linksLinks
Neural Fictitious Self-Play in Leduc Holdem
☆11Jul 4, 2018Updated 7 years ago
Alternatives and similar repositories for nfsp-leduc
Users that are interested in nfsp-leduc are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithms to play Poker☆14Dec 29, 2021Updated 4 years ago
- Potential-Aware Imperfect-Recall Abstraction with Earth Mover’s Distance in Imperfect-Information Games☆16Nov 29, 2025Updated 2 months ago
- Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.☆11Jan 17, 2020Updated 6 years ago
- Simple Redux like state management library based on RxJs.☆11Dec 14, 2017Updated 8 years ago
- UNO card game with PyGame GUI☆11Sep 29, 2021Updated 4 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- ☆11Mar 20, 2022Updated 3 years ago
- Probabilistic Streaming Tensor Decomposition @ ICDM'2018☆12Apr 22, 2019Updated 6 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 3 years ago
- Talent builder for Rise of Kingdoms☆13May 18, 2025Updated 8 months ago
- ☆10Feb 28, 2019Updated 6 years ago
- A web app for sharing, editing, and commenting on kifus (game records for the board game Go)☆10Jan 22, 2019Updated 7 years ago
- This is MPE-pytorch, fix some bugs.☆10Apr 26, 2020Updated 5 years ago
- This is the sample files created for Excel Financial Modeling☆11Nov 2, 2022Updated 3 years ago
- ☆12Oct 25, 2020Updated 5 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- Simple C++ project that includes header only implementations of Monte Carlo Tree Search(MCTS), Temporal Difference Learning, Minimax, an…☆11Jan 29, 2026Updated 2 weeks ago
- ☆10Nov 27, 2019Updated 6 years ago
- Tensor Belief Propagation - algorithm for approximate inference in discrete graphical models☆12Feb 17, 2020Updated 6 years ago
- FailureSensorIQ, a dataset and benchmark to probe LLMs’ reasoning and comprehension of sensor–failure relationships in industrial systems…☆33Feb 5, 2026Updated last week
- PyTorch implementation of "Variational Autoencoders with Jointly Optimized Latent Dependency Structure" [ICLR 2019]☆13Jul 14, 2019Updated 6 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated 9 months ago
- A repository around the Annual Computer Poker Competition server, for https://github.com/deepmind/open_spiel☆13Apr 13, 2021Updated 4 years ago
- Comp 781 Project☆10Jan 2, 2026Updated last month
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- A demo to show how to convert a TensorFlow model to TensorRT uff or PLAN☆11Jul 22, 2018Updated 7 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- LTL2PDDL tool☆11Jul 7, 2017Updated 8 years ago
- Python program to convert a Context Free Grammar to Chomsky Normal Form.☆10May 9, 2025Updated 9 months ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 6 years ago
- Solution to Kaggle's Google Research Football Competition☆14Dec 2, 2020Updated 5 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- LLM-enabled Robot Swarms☆17May 20, 2025Updated 8 months ago
- My Homepage☆10Feb 5, 2026Updated last week
- Aquaplanning QUick Automated Planning.☆13Oct 13, 2020Updated 5 years ago