(Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.
☆19Oct 8, 2016Updated 9 years ago
Alternatives and similar repositories for Reinforcement_Learning_Project
Users that are interested in Reinforcement_Learning_Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- Some code for tutorials following https://gym.openai.com/docs/rl☆15Jul 3, 2016Updated 9 years ago
- 南京大学本科毕业论文模板☆13Jun 1, 2016Updated 10 years ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆36Feb 18, 2020Updated 6 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Graph Convolutional Neural Networks for Alzheimer’s Classification with transfer learning and HPC methods☆12Sep 20, 2021Updated 4 years ago
- Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.☆33Dec 14, 2018Updated 7 years ago
- Official implementation of paper "An objective quantitative diagnosis of depression using a local-to-global multi-modal fusion graph neur…☆14Jan 13, 2025Updated last year
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Jun 29, 2016Updated 10 years ago
- ☆10Dec 25, 2019Updated 6 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆31Jul 17, 2017Updated 8 years ago
- Domain Adaptation with Randomized Expectation Maximization☆14Jan 16, 2019Updated 7 years ago
- in progress☆60Feb 19, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Vehicle detection based on YOLO and SVM☆15Jan 29, 2018Updated 8 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆132May 5, 2019Updated 7 years ago
- Skoltech, Term1 Fall course☆12Oct 1, 2021Updated 4 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- A short hands-on of CNN using Stanford CS231n online material☆17Oct 23, 2017Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- Combine fMRI/EEG to learn about music/auditory processing☆16Dec 8, 2022Updated 3 years ago
- A modified Alphazero implementation with C++ where performance matters.☆19Jun 16, 2026Updated 2 weeks ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Yelp Restaurant Photo Classification - Kaggle competition☆11Apr 19, 2019Updated 7 years ago
- Reinforcement Learning Assembly☆94Sep 2, 2021Updated 4 years ago
- Non official torchnet package for vision☆20Feb 4, 2017Updated 9 years ago
- ☆10Nov 19, 2015Updated 10 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Movielens collaborative filtering with Solr streaming expression☆10Oct 13, 2016Updated 9 years ago
- Adversarial Learning Based Node-Edge Graph Attention Networks for Autism Spectrum Disorder Identification☆13Jun 29, 2022Updated 4 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 16 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Reinforcement learning in 3D.☆21Mar 29, 2017Updated 9 years ago
- Multi-TransSP for MICCAI2022☆18Jun 20, 2022Updated 4 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- This is a demo of how to read a string from an Arduino serially with C in a Unix environment☆12Jul 3, 2012Updated 13 years ago
- Originally cloned from puzzledqs's mpi-parallel version. This version uses MPI to enable multi-GPU processing. Also includes some useful …☆17Jul 21, 2017Updated 8 years ago
- ☆15Apr 14, 2025Updated last year
- Sparse Learning via Efficient Projection (mirror copy of latest version in http://www.yelab.net/software/SLEP/)☆22Dec 16, 2015Updated 10 years ago