Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆36Feb 18, 2020Updated 6 years ago
Alternatives and similar repositories for Chapter15-AlphaZero
Users that are interested in Chapter15-AlphaZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 3 months ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆221Feb 28, 2025Updated last year
- AirSim based multi uav predictive manteinance application using reinforcement learning☆25Jun 6, 2021Updated 4 years ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆14May 22, 2023Updated 3 years ago
- Connect6 AI based on reinforcement learning☆12Sep 13, 2019Updated 6 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 6 years ago
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆52Apr 10, 2020Updated 6 years ago
- ADP☆13Apr 12, 2017Updated 9 years ago
- Dynamic ensemble learning based on RL and multi-objective optimization. Deep reinforcement learning and NSGA2 are combined to realize dy…☆32Jul 28, 2023Updated 2 years ago
- MiniGPT-Pancreas: Multimodal Large language Model for Pancreas Cancer Classification and Detection☆12Sep 19, 2025Updated 8 months ago
- ☆17May 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A learning-based scheme to capture external force/torque caused by payload of tethered-UAV system☆20May 27, 2025Updated 11 months ago
- A collection of free online materials for control engineering☆20Feb 4, 2025Updated last year
- Google MobileNets Implementation using Tensorflow☆18Jun 6, 2017Updated 8 years ago
- ☆10Dec 9, 2021Updated 4 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Using multiple sensor modalities to improve exploration for robotic manipulation tasks with sparse rewards☆10Sep 17, 2019Updated 6 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- ☆38May 2, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of improved AlphaGo algorithm in the game of Gomoku.☆58Nov 12, 2019Updated 6 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- ☆15Mar 26, 2024Updated 2 years ago
- A simple and efficient llama3 local service deployment solution that supports real-time streaming response and is optimized for common Ch…☆13Jul 31, 2024Updated last year
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 11 months ago
- ☆10Mar 24, 2023Updated 3 years ago
- Spatial Transformer Nets in TensorFlow/ TensorLayer☆36Jun 17, 2019Updated 6 years ago
- A method adapted from the paper Nonlinear System Identification of Soft Robot Dynamics Using Koopman Operator Theory by D. Bruder et al t…☆12Sep 24, 2020Updated 5 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Apr 14, 2025Updated last year
- OpenControl is a python package that implements basic algorithms for the analysis and design of optimal feedback controllers.☆15Jul 16, 2021Updated 4 years ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 5 months ago
- ☆28Jun 24, 2019Updated 6 years ago
- Model-based shared control of human-machine systems☆14Jul 26, 2018Updated 7 years ago
- [AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".☆21Jul 26, 2025Updated 9 months ago
- Mobile App Interface to interact with OpenAI (DALLE 2 and ChatGPT) open source tools☆13Jan 16, 2023Updated 3 years ago