Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆36Feb 18, 2020Updated 6 years ago
Alternatives and similar repositories for Chapter15-AlphaZero
Users that are interested in Chapter15-AlphaZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chapter 13 Learning to Run in book Deep Reinforcement Learning: code example of solving NIPS 2017: Learning to Run challenge with paralle…☆13Jul 4, 2021Updated 4 years ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- ☆61Jan 12, 2019Updated 7 years ago
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆14May 22, 2023Updated 2 years ago
- Connect6 AI based on reinforcement learning☆12Sep 13, 2019Updated 6 years ago
- Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.☆20May 18, 2020Updated 5 years ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16Jan 24, 2025Updated last year
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆19Aug 9, 2025Updated 7 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- ☆11Sep 6, 2024Updated last year
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- Implementation of Compressed SGD with Compressed Gradients in Pytorch☆13Jul 25, 2024Updated last year
- On Lipschitz Regularization of Convolutional Layers using Toeplitz Matrix Theory☆10Aug 19, 2021Updated 4 years ago
- Jean Gallier‘s Algebra, Topology, Differential Calculus, and Optimization Theory for Computer Science and Machine Learning Chinese versio…☆11Apr 16, 2020Updated 5 years ago
- C311 Spring 2022☆13Mar 17, 2025Updated last year
- Basic template for using Flan-t5 on Banana's serverless GPU platform. Ready for 1-Click deploy☆11Jan 30, 2023Updated 3 years ago
- Understanding Deep Learning☆11Jul 23, 2024Updated last year
- DQN examples codes in chapter 4☆44Mar 24, 2023Updated 3 years ago
- MiniGPT-Pancreas: Multimodal Large language Model for Pancreas Cancer Classification and Detection☆11Sep 19, 2025Updated 6 months ago
- ☆11Sep 27, 2022Updated 3 years ago
- Transfer PaddlePaddle's codes to TensorLayerX's codes☆10Feb 10, 2023Updated 3 years ago
- Common support code for user-facing front end systems.☆12Updated this week
- ☆55Aug 30, 2023Updated 2 years ago
- A learning-based scheme to capture external force/torque caused by payload of tethered-UAV system☆19May 27, 2025Updated 9 months ago
- A collection of free online materials for control engineering☆19Feb 4, 2025Updated last year
- This repository contains the code for implementing the algorithms in the paper "Semantics-Guided Diffusion for Deep Joint Source-Channel …☆38Apr 1, 2025Updated 11 months ago
- ☆10Dec 9, 2021Updated 4 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Using multiple sensor modalities to improve exploration for robotic manipulation tasks with sparse rewards☆10Sep 17, 2019Updated 6 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- 华南师范大学Beamer模板☆15Nov 11, 2020Updated 5 years ago
- ☆15Mar 26, 2024Updated last year
- this is the pytorch implementation of the paper: Beamforming Design for Large-Scale Antenna Arrays Using Deep Learning☆14Jun 1, 2020Updated 5 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- A simple and efficient llama3 local service deployment solution that supports real-time streaming response and is optimized for common Ch…☆13Jul 31, 2024Updated last year
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 9 months ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- ☆10Mar 24, 2023Updated 3 years ago