基于DQN的五子棋人机对弈
☆62Mar 24, 2019Updated 7 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A DeepQLearging AI playing Gomoku☆10Mar 3, 2019Updated 7 years ago
- 用Python语言和Tkinter图形库实现的一个简单的五子棋程序☆18Jun 7, 2016Updated 10 years ago
- ☆23Dec 23, 2017Updated 8 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆51Apr 10, 2020Updated 6 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆53Feb 10, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Extension of OpenAI Gym that implements multiple two-player zero-sum 2-dimension board games☆11Sep 11, 2022Updated 3 years ago
- This project solves self-made maze in a variety of ways: A-star, Q-learning and Deep Q-network.☆28Apr 1, 2017Updated 9 years ago
- ☆18Mar 23, 2023Updated 3 years ago
- A Gobang(also known as "Five in a Row" and "Gomoku") game equipped with AlphaGo-liked AI.☆14May 1, 2020Updated 6 years ago
- ☆30May 24, 2025Updated last year
- Automate hyper-parameters tuning for NNs (learning rate, number of dense layers and nodes and activation function)☆14Aug 9, 2020Updated 5 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated 2 years ago
- ☆12Aug 28, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆89Oct 11, 2024Updated last year
- An open source video conferencing tool for the XO laptop☆16Sep 20, 2013Updated 12 years ago
- ROS2 package that allows recording without interprocess communication☆19Apr 16, 2026Updated 2 months ago
- ☆12Feb 10, 2026Updated 4 months ago
- Tools for geospatial analysis of radar rainfall fields☆12Nov 30, 2016Updated 9 years ago
- 考研数据结构练习;目前在使用C更细致的重写:https://github.com/by777/dataStructureForC☆10Sep 26, 2018Updated 7 years ago
- Code for the KDD 2022 paper "Interpreting Trajectories from Multiple Views: A Hierarchical Self-Attention Network for Estimating the Time…☆18May 29, 2022Updated 4 years ago
- Input files and results of paper: Phase equilibrium of liquid water and hexagonal from ice enhanced sampling molecular dynamics simulatio…☆10Apr 9, 2021Updated 5 years ago
- ☆12Mar 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Final Thesis at Fudan University, built a trading strategy on Bitcoin market using recurrent reinforcement learning☆27Nov 5, 2018Updated 7 years ago
- 使用ROS2+RL 的循迹小车☆15Aug 30, 2024Updated last year
- Statistical methods for estimating scaling laws in urban data☆11Dec 9, 2024Updated last year
- Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…☆13Feb 2, 2019Updated 7 years ago
- Matlab version of all the code of Lorena A. Barba's 12 steps to Navier stokes☆13Jul 17, 2019Updated 6 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- ☆11Dec 26, 2017Updated 8 years ago
- ☆70Updated this week
- Codes for understanding Reinforcement Learning( updating... )☆24Jan 2, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于强化学习的五子棋☆12Dec 30, 2018Updated 7 years ago
- Multi-Candidate Speculative Decoding☆41Apr 22, 2024Updated 2 years ago
- A Finite Element Approximation of a Cahn--Hilliard Tumour Model with FEniCS, by Dennis Trautwein (2020).☆10Oct 11, 2020Updated 5 years ago
- alphaGo版本的五子棋(gobang, gomoku)☆68Mar 17, 2020Updated 6 years ago
- ☆14May 31, 2022Updated 4 years ago
- ☆14Oct 11, 2022Updated 3 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago