Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆36Feb 18, 2020Updated 6 years ago
Alternatives and similar repositories for Chapter15-AlphaZero
Users that are interested in Chapter15-AlphaZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chapter 13 Learning to Run in book Deep Reinforcement Learning: code example of solving NIPS 2017: Learning to Run challenge with paralle…☆13Jul 4, 2021Updated 5 years ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 4 months ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- Click Me -->☆32Mar 3, 2023Updated 3 years ago
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆14May 22, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Connect6 AI based on reinforcement learning☆12Sep 13, 2019Updated 6 years ago
- ☆61Jan 12, 2019Updated 7 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 6 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆20Aug 9, 2025Updated 10 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Jean Gallier‘s Algebra, Topology, Differential Calculus, and Optimization Theory for Computer Science and Machine Learning Chinese versio…☆12Apr 16, 2020Updated 6 years ago
- ☆13Apr 29, 2023Updated 3 years ago
- C311 Spring 2022☆13Mar 17, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Basic template for using Flan-t5 on Banana's serverless GPU platform. Ready for 1-Click deploy☆11Jan 30, 2023Updated 3 years ago
- Understanding Deep Learning☆11Jul 23, 2024Updated last year
- ADP☆13Apr 12, 2017Updated 9 years ago
- Benchmark Generator for Global Routing☆13Jul 18, 2019Updated 6 years ago
- Dynamic ensemble learning based on RL and multi-objective optimization. Deep reinforcement learning and NSGA2 are combined to realize dy…☆32Jul 28, 2023Updated 2 years ago
- MiniGPT-Pancreas: Multimodal Large language Model for Pancreas Cancer Classification and Detection☆13Sep 19, 2025Updated 9 months ago
- ☆11Sep 27, 2022Updated 3 years ago
- ☆17Jun 9, 2026Updated 3 weeks ago
- ☆19Jun 10, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Common support code for user-facing front end systems.☆12Jun 23, 2026Updated last week
- A gomoku game engine based on MCTS method combined with DNN. Using C++ and Python.☆15Nov 21, 2018Updated 7 years ago
- A learning-based scheme to capture external force/torque caused by payload of tethered-UAV system☆21May 27, 2025Updated last year
- ☆10Dec 9, 2021Updated 4 years ago
- ☆14Nov 1, 2016Updated 9 years ago
- Using multiple sensor modalities to improve exploration for robotic manipulation tasks with sparse rewards☆10Sep 17, 2019Updated 6 years ago
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆135May 2, 2026Updated 2 months ago
- ☆38May 2, 2019Updated 7 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Mar 26, 2024Updated 2 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- ☆10Mar 24, 2023Updated 3 years ago
- Spatial Transformer Nets in TensorFlow/ TensorLayer☆36Jun 17, 2019Updated 7 years ago
- A method adapted from the paper Nonlinear System Identification of Soft Robot Dynamics Using Koopman Operator Theory by D. Bruder et al t…☆12Sep 24, 2020Updated 5 years ago
- This is the notebooks for videos in my Bilibili Channel (https://space.bilibili.com/32773300?spm_id_from=333.1007.0.0)☆35Nov 6, 2025Updated 7 months ago