(Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.
☆19Oct 8, 2016Updated 9 years ago
Alternatives and similar repositories for Reinforcement_Learning_Project
Users that are interested in Reinforcement_Learning_Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- ☆12Oct 30, 2022Updated 3 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆14Jul 3, 2016Updated 9 years ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.☆33Dec 14, 2018Updated 7 years ago
- Interesting and colorful Alert style--iOS OC&Swift炫酷的可编辑弹窗(AlertController/Alert)☆12Apr 18, 2019Updated 6 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Jun 29, 2016Updated 9 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- ☆10Dec 25, 2019Updated 6 years ago
- Hex board game with MCTS implementation☆12Aug 15, 2023Updated 2 years ago
- in progress☆60Feb 19, 2016Updated 10 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆134May 5, 2019Updated 6 years ago
- Interactive Multi-Agent Reinforcement Learning Environment for the board game Gobblet using PettingZoo.☆12Jul 2, 2023Updated 2 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- A short hands-on of CNN using Stanford CS231n online material☆17Oct 23, 2017Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- Python binding to the levmar library using Cython☆20Mar 18, 2020Updated 6 years ago
- A Swift Wrapper for PyTorch and Torchvision.☆14Jul 19, 2019Updated 6 years ago
- ORB-SLAM2-IMU-VIO 直接法加速的惯导加持的ORB-SLAM2☆11Nov 21, 2018Updated 7 years ago
- A Gobang(also known as "Five in a Row" and "Gomoku") game equipped with AlphaGo-liked AI.☆14May 1, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆42Oct 8, 2020Updated 5 years ago
- ☆12Sep 23, 2020Updated 5 years ago
- Combine fMRI/EEG to learn about music/auditory processing☆16Dec 8, 2022Updated 3 years ago
- ☆13Jul 4, 2017Updated 8 years ago
- An implementation in python of some game agents such as AlphaBeta or MCTS, that can be applied to any n-player non deterministic game obj…☆12May 29, 2022Updated 3 years ago
- Yelp Restaurant Photo Classification - Kaggle competition☆12Apr 19, 2019Updated 6 years ago
- Reinforcement Learning Assembly☆92Sep 2, 2021Updated 4 years ago
- Non official torchnet package for vision☆20Feb 4, 2017Updated 9 years ago
- ☆10Nov 19, 2015Updated 10 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Oct 13, 2016Updated 9 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Monorepo/Yarn Workspaces/Lerna with Create React App in TypeScript that produces sourcemap for VSCode debugging and Sentry reports☆11Apr 7, 2023Updated 3 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- The code for our submission in Kaggle's competition Quora Question Pairs which ranked in the top 25%.☆30May 10, 2020Updated 5 years ago