Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆36Feb 18, 2020Updated 6 years ago
Alternatives and similar repositories for Chapter15-AlphaZero
Users that are interested in Chapter15-AlphaZero are comparing it to the libraries listed below
Sorting:
- Chapter 13 Learning to Run in book Deep Reinforcement Learning: code example of solving NIPS 2017: Learning to Run challenge with paralle…☆13Jul 4, 2021Updated 4 years ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated last week
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- AirSim based multi uav predictive manteinance application using reinforcement learning☆24Jun 6, 2021Updated 4 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- ☆17Jan 25, 2021Updated 5 years ago
- Connect6 AI based on reinforcement learning☆12Sep 13, 2019Updated 6 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- Dataset containing source code and deployed bytecode for Solidity Smart Contracts that have been verified on Etherscan.io, along with a c…☆25Jun 13, 2022Updated 3 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆18Aug 9, 2025Updated 6 months ago
- Rubik ESP32 esp-idf Device driver library.☆12Jul 3, 2021Updated 4 years ago
- Verilog code for a low power RFID chip that will communicate with I2C sensors.☆13Apr 18, 2014Updated 11 years ago
- A python script to calculate radar cross section.☆11Dec 26, 2023Updated 2 years ago
- ☆28Jun 24, 2019Updated 6 years ago
- Click Me -->☆32Mar 3, 2023Updated 3 years ago
- Spatial Transformer Nets in TensorFlow/ TensorLayer☆36Jun 17, 2019Updated 6 years ago
- ☆10Dec 19, 2019Updated 6 years ago
- LC6500DMD python control☆11Nov 15, 2016Updated 9 years ago
- ADP☆12Apr 12, 2017Updated 8 years ago
- ☆12Jan 12, 2019Updated 7 years ago
- High-resolution time-to-digital converter in the Red Pitaya Zynq-7010 SoC☆10Jul 12, 2020Updated 5 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- ☆14Mar 21, 2024Updated last year
- Transfer PaddlePaddle's codes to TensorLayerX's codes☆10Feb 10, 2023Updated 3 years ago
- EBAZ4205 Board FPGA project☆14Oct 20, 2023Updated 2 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- SunFounder FPV Omni Car for Arduino☆14Feb 28, 2025Updated last year
- ☆10Mar 15, 2022Updated 3 years ago
- LLM Skirmish☆44Feb 3, 2026Updated last month
- F4CK V2PH is a script for scraping images from the V2PH website. It supports album scraping, image URL extraction, and an image downloade…☆13Sep 13, 2024Updated last year
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- ☆14Aug 19, 2025Updated 6 months ago
- ☆27Jan 9, 2026Updated last month
- The Bitmark Device☆10Oct 13, 2015Updated 10 years ago
- A Unity project connecting to a local Pozyx MQTT positioning stream.☆10Sep 13, 2019Updated 6 years ago
- ☆16May 31, 2024Updated last year