基于DQN的五子棋人机对弈
☆61Mar 24, 2019Updated 7 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Apr 15, 2018Updated 8 years ago
- 用深度学习+强化学习编写的一个五子棋人工智障☆45Feb 16, 2018Updated 8 years ago
- ☆23Dec 23, 2017Updated 8 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆52Apr 10, 2020Updated 6 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆53Feb 10, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Extension of OpenAI Gym that implements multiple two-player zero-sum 2-dimension board games☆11Sep 11, 2022Updated 3 years ago
- ☆18Oct 4, 2024Updated last year
- ☆12Aug 7, 2020Updated 5 years ago
- This project solves self-made maze in a variety of ways: A-star, Q-learning and Deep Q-network.☆28Apr 1, 2017Updated 9 years ago
- Official code repository for the paper "Rethinking Model Prototyping through the MedMNIST+ Dataset Collection" @ Scientific Reports☆13Mar 5, 2025Updated last year
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated last year
- DQN stock trading pytorch implementation☆41May 11, 2019Updated 6 years ago
- ☆12Aug 28, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Predicting 2D Steady State Fluid Flow Fields using Convolutional Neural Networks☆12Oct 3, 2020Updated 5 years ago
- Code used for the master thesis at MIIS (UPF)☆16Dec 1, 2016Updated 9 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆16May 22, 2022Updated 3 years ago
- Final Thesis at Fudan University, built a trading strategy on Bitcoin market using recurrent reinforcement learning☆27Nov 5, 2018Updated 7 years ago
- MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)☆11Jan 15, 2018Updated 8 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Jun 29, 2016Updated 9 years ago
- Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…☆13Feb 2, 2019Updated 7 years ago
- Matlab version of all the code of Lorena A. Barba's 12 steps to Navier stokes☆13Jul 17, 2019Updated 6 years ago
- Computer Vision Research Project☆11Aug 30, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO☆28Dec 8, 2022Updated 3 years ago
- 基于强化学习的五子棋☆12Dec 30, 2018Updated 7 years ago
- A Finite Element Approximation of a Cahn--Hilliard Tumour Model with FEniCS, by Dennis Trautwein (2020).☆10Oct 11, 2020Updated 5 years ago
- Dockerfile to build an image with Nginx and Node (npm and yarn) on Alpine Linux☆19Aug 23, 2018Updated 7 years ago
- 16-811 Project.☆10Jan 12, 2018Updated 8 years ago
- ☆14May 31, 2022Updated 3 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- A gobang robot based on reinforcement learning.☆165Mar 28, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Hyperpatameter Bayesian Optimization for Image Classification in PyTorch☆12Aug 20, 2019Updated 6 years ago
- Short text similarity matching model based on deep learning and machine learning☆15Jan 9, 2019Updated 7 years ago
- 【编译原理】语法分析实验☆12May 29, 2019Updated 6 years ago
- Library for solving time dependent PDEs in FEniCS using Runge-Kutta ESDIRK methods☆13Mar 30, 2026Updated last month
- Mirror of Sven Verdoolaege's isl at http://repo.or.cz/w/isl.git (occasionally with changes for islpy)☆10Dec 16, 2025Updated 4 months ago
- cat vs dog in caffe, tensorflow, pytorch, paddle☆13Mar 5, 2021Updated 5 years ago
- some strategies for exposure bias in seq2seq☆18Sep 9, 2020Updated 5 years ago