基于DQN的五子棋人机对弈
☆61Mar 24, 2019Updated 7 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Apr 15, 2018Updated 8 years ago
- 尝试了博弈树Min-Max + alpha-Beta剪枝方法,并找到了更好的适用于五子棋智能的棋局评估模型和选择模型☆54May 10, 2018Updated 8 years ago
- ☆16Aug 19, 2024Updated last year
- ☆23Dec 23, 2017Updated 8 years ago
- ☆22May 3, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆53Feb 10, 2019Updated 7 years ago
- Extension of OpenAI Gym that implements multiple two-player zero-sum 2-dimension board games☆11Sep 11, 2022Updated 3 years ago
- ☆18Oct 4, 2024Updated last year
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- ☆18Mar 23, 2023Updated 3 years ago
- ☆30May 24, 2025Updated last year
- Automate hyper-parameters tuning for NNs (learning rate, number of dense layers and nodes and activation function)☆14Aug 9, 2020Updated 5 years ago
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated 2 years ago
- Ultralightweight JSON parser in ANSI C☆10Mar 8, 2017Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆89Oct 11, 2024Updated last year
- ROS2 package that allows recording without interprocess communication☆19Apr 16, 2026Updated last month
- 贴吧舆情监测及干预工具☆13May 10, 2017Updated 9 years ago
- Dice Scores Recognition in images and live video using CNN.☆13Dec 19, 2020Updated 5 years ago
- Code used for the master thesis at MIIS (UPF)☆16Dec 1, 2016Updated 9 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆16May 22, 2022Updated 4 years ago
- Input files and results of paper: Phase equilibrium of liquid water and hexagonal from ice enhanced sampling molecular dynamics simulatio…☆10Apr 9, 2021Updated 5 years ago
- Demonstration of a factory pattern where the types automatically register themselves☆13Mar 13, 2019Updated 7 years ago
- ☆12Mar 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Turn any camera (Insta360, RealSense, USB webcam, etc.) into ROS2 image topics. Unified config for VLA deployment and SFT data collection…☆43Feb 4, 2026Updated 3 months ago
- 基于Deep Qlearning Network的股票交易模型☆57May 15, 2017Updated 9 years ago
- MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)☆11Jan 15, 2018Updated 8 years ago
- Generation of columnar jointed rock using Voronoi method☆10Dec 13, 2019Updated 6 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Jun 29, 2016Updated 9 years ago
- Matlab version of all the code of Lorena A. Barba's 12 steps to Navier stokes☆13Jul 17, 2019Updated 6 years ago
- Computer Vision Research Project☆11Aug 30, 2019Updated 6 years ago
- Geog 2021 Environmental Remote Sensing☆16Jan 4, 2019Updated 7 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- ☆57Apr 30, 2026Updated 3 weeks ago
- Codes for understanding Reinforcement Learning( updating... )☆24Jan 2, 2019Updated 7 years ago
- 基于强化学习的五子棋☆12Dec 30, 2018Updated 7 years ago
- Multi-Candidate Speculative Decoding☆40Apr 22, 2024Updated 2 years ago
- A Finite Element Approximation of a Cahn--Hilliard Tumour Model with FEniCS, by Dennis Trautwein (2020).☆10Oct 11, 2020Updated 5 years ago
- ☆14May 31, 2022Updated 3 years ago