Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆36Feb 18, 2020Updated 6 years ago
Alternatives and similar repositories for Chapter15-AlphaZero
Users that are interested in Chapter15-AlphaZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated last month
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆220Feb 28, 2025Updated last year
- AirSim based multi uav predictive manteinance application using reinforcement learning☆25Jun 6, 2021Updated 4 years ago
- ☆61Jan 12, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆14May 22, 2023Updated 2 years ago
- Connect6 AI based on reinforcement learning☆12Sep 13, 2019Updated 6 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆19Aug 9, 2025Updated 8 months ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16Jan 24, 2025Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆52Apr 10, 2020Updated 6 years ago
- ☆11Sep 27, 2022Updated 3 years ago
- ☆17May 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Transfer PaddlePaddle's codes to TensorLayerX's codes☆10Feb 10, 2023Updated 3 years ago
- Common support code for user-facing front end systems.☆12Updated this week
- A collection of free online materials for control engineering☆19Feb 4, 2025Updated last year
- Google MobileNets Implementation using Tensorflow☆18Jun 6, 2017Updated 8 years ago
- ☆10Dec 9, 2021Updated 4 years ago
- Using multiple sensor modalities to improve exploration for robotic manipulation tasks with sparse rewards☆10Sep 17, 2019Updated 6 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- ☆38May 2, 2019Updated 6 years ago
- ☆15Mar 26, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- A simple and efficient llama3 local service deployment solution that supports real-time streaming response and is optimized for common Ch…☆13Jul 31, 2024Updated last year
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 10 months ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- ☆10Mar 24, 2023Updated 3 years ago
- A method adapted from the paper Nonlinear System Identification of Soft Robot Dynamics Using Koopman Operator Theory by D. Bruder et al t…☆12Sep 24, 2020Updated 5 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 2 months ago
- The figures for the Deep Learning textbook (www.deeplearningbook.org)☆17Oct 9, 2017Updated 8 years ago
- OpenControl is a python package that implements basic algorithms for the analysis and design of optimal feedback controllers.☆15Jul 16, 2021Updated 4 years ago
- Dataset containing source code and deployed bytecode for Solidity Smart Contracts that have been verified on Etherscan.io, along with a c…☆25Jun 13, 2022Updated 3 years ago
- Model-based shared control of human-machine systems☆14Jul 26, 2018Updated 7 years ago
- ☆10Mar 15, 2022Updated 4 years ago