(Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.
☆19Oct 8, 2016Updated 9 years ago
Alternatives and similar repositories for Reinforcement_Learning_Project
Users that are interested in Reinforcement_Learning_Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆15Jul 3, 2016Updated 9 years ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Graph Convolutional Neural Networks for Alzheimer’s Classification with transfer learning and HPC methods☆12Sep 20, 2021Updated 4 years ago
- Official implementation of paper "An objective quantitative diagnosis of depression using a local-to-global multi-modal fusion graph neur…☆14Jan 13, 2025Updated last year
- Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.☆33Dec 14, 2018Updated 7 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Jun 29, 2016Updated 9 years ago
- Simple JS webapp for drawing bezier curves☆16Dec 19, 2017Updated 8 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆31Jul 17, 2017Updated 8 years ago
- Domain Adaptation with Randomized Expectation Maximization☆14Jan 16, 2019Updated 7 years ago
- in progress☆60Feb 19, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 7 years ago
- Interactive Multi-Agent Reinforcement Learning Environment for the board game Gobblet using PettingZoo.☆12Jul 2, 2023Updated 2 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- A short hands-on of CNN using Stanford CS231n online material☆17Oct 23, 2017Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- AlphaZero implementation on Gomoku☆18Feb 26, 2025Updated last year
- Minimax with alpha-beta pruning. Bitmasking and WebWorkers for performance.☆13Nov 3, 2021Updated 4 years ago
- Python binding to the levmar library using Cython☆20Mar 18, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ORB-SLAM2-IMU-VIO 直接法加速的惯导加持的ORB-SLAM2☆11Nov 21, 2018Updated 7 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆42Oct 8, 2020Updated 5 years ago
- ☆12Sep 23, 2020Updated 5 years ago
- ☆13Jul 4, 2017Updated 8 years ago
- Various dotfiles I use on my machines.☆19Jun 2, 2026Updated last week
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Modification of SOMPY repo with robust K-means clustering (bootstrapped SSE elbow method)☆13Apr 6, 2019Updated 7 years ago
- Yelp Restaurant Photo Classification - Kaggle competition☆11Apr 19, 2019Updated 7 years ago
- Non official torchnet package for vision☆20Feb 4, 2017Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Nov 19, 2015Updated 10 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Movielens collaborative filtering with Solr streaming expression☆10Oct 13, 2016Updated 9 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- Reinforcement learning in 3D.☆21Mar 29, 2017Updated 9 years ago
- Targetprocess 3 Mashup Library☆26Sep 4, 2025Updated 9 months ago