(Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.
☆19Oct 8, 2016Updated 9 years ago
Alternatives and similar repositories for Reinforcement_Learning_Project
Users that are interested in Reinforcement_Learning_Project are comparing it to the libraries listed below
Sorting:
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆14Jul 3, 2016Updated 9 years ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.☆33Dec 14, 2018Updated 7 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Jun 29, 2016Updated 9 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆31Jul 17, 2017Updated 8 years ago
- in progress☆61Feb 19, 2016Updated 10 years ago
- Vehicle detection based on YOLO and SVM☆15Jan 29, 2018Updated 8 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- A short hands-on of CNN using Stanford CS231n online material☆17Oct 23, 2017Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- AlphaZero implementation on Gomoku☆18Feb 26, 2025Updated last year
- A Swift Wrapper for PyTorch and Torchvision.☆14Jul 19, 2019Updated 6 years ago
- General Python implementation of Monte Carlo Tree Search for the use with Open AI Gym environments.☆42Oct 8, 2020Updated 5 years ago
- ☆12Sep 23, 2020Updated 5 years ago
- A modified Alphazero implementation with C++ where performance matters.☆18Mar 7, 2026Updated 2 weeks ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- Modification of SOMPY repo with robust K-means clustering (bootstrapped SSE elbow method)☆13Apr 6, 2019Updated 6 years ago
- Yelp Restaurant Photo Classification - Kaggle competition☆12Apr 19, 2019Updated 6 years ago
- Reinforcement Learning Assembly☆92Sep 2, 2021Updated 4 years ago
- Non official torchnet package for vision☆20Feb 4, 2017Updated 9 years ago
- ☆14Apr 14, 2025Updated 11 months ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Oct 13, 2016Updated 9 years ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Multi-TransSP for MICCAI2022☆18Jun 20, 2022Updated 3 years ago
- ☆30Oct 20, 2015Updated 10 years ago
- Machine learning in nim☆12Aug 16, 2014Updated 11 years ago
- A framework for Awesome WM config☆10Aug 22, 2021Updated 4 years ago
- ☆11Jan 23, 2017Updated 9 years ago
- Recurrent Convolutional Memory Network (in progress)☆29Apr 16, 2016Updated 9 years ago
- Loitor VI Sensor Documents☆12Jun 15, 2017Updated 8 years ago
- Optical flow with convolutional neural networks for vision-based guidance of UAS☆11Aug 23, 2017Updated 8 years ago
- Deep reinforcement learning in ViZDoom (using Tensorflow)☆19Jan 25, 2018Updated 8 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Provides Movie Recommendations on the MovieLens ml-100k dataset using Collaborative Filtering☆11Nov 14, 2013Updated 12 years ago