A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.
☆50Jun 6, 2018Updated 7 years ago
Alternatives and similar repositories for rl-tictactoe
Users that are interested in rl-tictactoe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the classic game of Tic-Tac-Toe using Reinforcement Learning☆14Feb 9, 2014Updated 12 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Dec 30, 2014Updated 11 years ago
- Python interface for the Berkeley Parser using JPype☆12Dec 18, 2015Updated 10 years ago
- Variational Factorization Machines☆17Dec 20, 2016Updated 9 years ago
- Machine Learning Hackathon organized by Hackerearth☆13Feb 2, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Jul 19, 2019Updated 6 years ago
- Theano implementation of the Neural GPU☆15Jan 5, 2016Updated 10 years ago
- An experimental example of how to use OpenGL for physical simulations. All the simulation runs concurrently in the GPU using my own engin…☆33Sep 15, 2014Updated 11 years ago
- C++ training and testing code for an SVM using Vlfeat fisher vectors together with possible other features.☆14Jun 29, 2016Updated 9 years ago
- load word embeddings to Torch.Tensor☆14May 12, 2016Updated 9 years ago
- For training very deep networks☆10Jun 12, 2017Updated 8 years ago
- Just a simple use example of the conv2d_transpose function in TensorFlow. Its run on MNIST.☆23Apr 23, 2016Updated 9 years ago
- Implementation of algorithms from AIMA (Artificial Intelligence: A Modern Approach) in Python☆15Aug 10, 2011Updated 14 years ago
- hierarchical Q-learning implementation☆11Jun 9, 2015Updated 10 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Poincaré Embeddings for Learning Hierarchical Representations (https://arxiv.org/abs/1705.08039) in PyTorch☆15Dec 20, 2017Updated 8 years ago
- Deep Learning Tutorial notes and code. See the wiki for more info.☆12Nov 11, 2014Updated 11 years ago
- This GUI Program helps you download songs from Spotify.☆10Dec 16, 2021Updated 4 years ago
- The implementation of "Neural Networks for Open Domain Targeted Sentiment" based on package https://github.com/SUTDNLP/LibN3L☆18Dec 7, 2015Updated 10 years ago
- Temporary repository for implementing tensor factorization algorithms on Apache Spark☆13Nov 27, 2017Updated 8 years ago
- Coach compensation calculator using Vue and d3☆10Jan 3, 2023Updated 3 years ago
- Imagenie - Smart text over images☆18Jun 7, 2016Updated 9 years ago
- Latex resume template☆12Mar 29, 2012Updated 14 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Sep 1, 2014Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Python binding for libpuzzle.☆45Sep 8, 2020Updated 5 years ago
- Lightweight Cryptocurrency Monitor☆15Feb 14, 2019Updated 7 years ago
- Twitter text processing library (auto linking and extraction of usernames, lists and hashtags). Based on the Java implementation by Matt …☆88Jul 28, 2014Updated 11 years ago
- This is a very fast parsing script for downloaded TV shows and movies. It will use scene-standard naming conventions (and a lot of nonsta…☆16Oct 30, 2017Updated 8 years ago
- 微信小程序wx对象的API,promise化☆12Apr 8, 2019Updated 7 years ago
- A moment-free estimator of the Sharpe (signal-to-noise) ratio.☆12Dec 27, 2022Updated 3 years ago
- Dynamic dispatch over arbitrary predicates☆10Feb 2, 2016Updated 10 years ago
- A CUDA-enabled SIFT library☆15Apr 2, 2016Updated 10 years ago
- Simple reinforcement learning in Python.☆202Feb 11, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Theano port of Ross Girshick's ROI Pooling Layer☆28Oct 19, 2016Updated 9 years ago
- Fisher vectors for video classification☆21May 7, 2018Updated 7 years ago
- Speed you SHA. A different hash style.☆13Jun 13, 2016Updated 9 years ago
- ☆10Apr 4, 2018Updated 8 years ago
- Implementation of "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"☆11Nov 7, 2015Updated 10 years ago
- trailing stop loss daemon that tracks performance via Philips Hue☆11Apr 24, 2018Updated 7 years ago
- ☆22Mar 11, 2016Updated 10 years ago