zhijs / -Reinforcement-Learning-five-in-a-row-View external linksLinks
基于DQN的五子棋人机对弈
☆62Mar 24, 2019Updated 6 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below
Sorting:
- A DeepQLearging AI playing Gomoku☆10Mar 3, 2019Updated 6 years ago
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Apr 15, 2018Updated 7 years ago
- 尝试了博弈树Min-Max + alpha-Beta剪枝方法,并找到了更好的适用于五子棋智能的棋局评估模型和选择模型☆54May 10, 2018Updated 7 years ago
- ☆18Oct 4, 2024Updated last year
- 用深度学习+强化学习编写的一个五子棋人工智障☆45Feb 16, 2018Updated 8 years ago
- 基于博弈树α-β剪枝搜索的五子棋AI☆779Jul 14, 2017Updated 8 years ago
- 用Python语言和Tkinter图形库实现的一个简单的五子棋程序☆18Jun 7, 2016Updated 9 years ago
- Codes for understanding Reinforcement Learning( updating... )☆24Jan 2, 2019Updated 7 years ago
- Final Thesis at Fudan University, built a trading strategy on Bitcoin market using recurrent reinforcement learning☆27Nov 5, 2018Updated 7 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- This project features a dynamic combat and traversal system inspired by Sekiro, incorporating fluid movement, precise timing, and strateg…☆13Oct 22, 2024Updated last year
- a Federated Learning Framework adapted for resource-constrained environments, focusing on IoT devices☆10Oct 6, 2025Updated 4 months ago
- Implementation of Dynamic Computation Offloading Control Logic in a Software-Defined Vehicle (SDV) System☆11Dec 19, 2024Updated last year
- ☆10Dec 10, 2021Updated 4 years ago
- A Read-time MIDI visualization tool using PyQt☆10Nov 24, 2020Updated 5 years ago
- Tensorflow DQN and DRQN agent playing doom☆35May 5, 2017Updated 8 years ago
- This project solves self-made maze in a variety of ways: A-star, Q-learning and Deep Q-network.☆28Apr 1, 2017Updated 8 years ago
- The DomUI Java User interface library☆13Updated this week
- [ICML 2024 Oral] Consistent Adversarial Robust Deep Q Networks (CAR-DQN)☆15Feb 27, 2025Updated 11 months ago
- Capture Star Citizen's meta data packets.☆11Oct 9, 2018Updated 7 years ago
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- Protect workers with TensorFlow Hard Hat object detection model on a Jetson Nano☆10Sep 27, 2022Updated 3 years ago
- 基于多个图片API的搜索服务和图标生成功能,专门设计用于与 Cursor MCP 服务集成。支持图片搜索、下载和AI生成图标。☆13May 8, 2025Updated 9 months ago
- Dice Scores Recognition in images and live video using CNN.☆13Dec 19, 2020Updated 5 years ago
- Vue + WebSocket + SpringBoot + MongoDB + Mysql + github图床完成的聊天系统,支持头像更改,私聊,聊天室,聊天记录存储☆11Jan 27, 2023Updated 3 years ago
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- This is the BFBAN community site, which contains both the back end and the front end. It is currently up and running. You can visit bfban…☆11Jan 20, 2026Updated 3 weeks ago
- ☆10Jul 26, 2024Updated last year
- Teaching the Donkey car to drive a track in the simulator using State Representation Learning and different Reinforcement Learning Algori…☆12Dec 6, 2021Updated 4 years ago
- My undergraduate final project - Modeling and control of a distillation column using neural networks and reinforcement learning.☆12Apr 28, 2020Updated 5 years ago
- Optimistic Bull or Pessimistic Bear: Adaptive Deep Reinforcement Learning for Stock Portfolio Allocation☆38Jun 11, 2019Updated 6 years ago
- A Finite Element Approximation of a Cahn--Hilliard Tumour Model with FEniCS, by Dennis Trautwein (2020).☆10Oct 11, 2020Updated 5 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- An HTTP client for the Rust AWS SDK that runs on Fastly Compute @ Edge☆10Nov 11, 2025Updated 3 months ago
- This operator will manage and configure data processing unit (DPUs) to be used in accelerating/offloading k8s networking functions☆12Jan 10, 2026Updated last month
- Generation of columnar jointed rock using Voronoi method☆10Dec 13, 2019Updated 6 years ago
- Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid☆10Jan 3, 2023Updated 3 years ago
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago