tansey / rl-tictactoeView external linksLinks
A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.
☆50Jun 6, 2018Updated 7 years ago
Alternatives and similar repositories for rl-tictactoe
Users that are interested in rl-tictactoe are comparing it to the libraries listed below
Sorting:
- Machine Learning Hackathon organized by Hackerearth☆13Feb 2, 2016Updated 10 years ago
- Theano implementation of the Neural GPU☆15Jan 5, 2016Updated 10 years ago
- Implementation of the classic game of Tic-Tac-Toe using Reinforcement Learning☆14Feb 9, 2014Updated 12 years ago
- ☆22Aug 18, 2020Updated 5 years ago
- ☆22Mar 11, 2016Updated 9 years ago
- Theano port of Ross Girshick's ROI Pooling Layer☆28Oct 19, 2016Updated 9 years ago
- This GUI Program helps you download songs from Spotify.☆10Dec 16, 2021Updated 4 years ago
- Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those i…☆10Aug 8, 2021Updated 4 years ago
- Trading algorithm for Bitcoins in USD on quantconnect.com☆13Jan 12, 2018Updated 8 years ago
- A short dirty python script to ping a Flight Comparison website and send an email notifying if a maximum price (or below) is available.☆12Oct 17, 2012Updated 13 years ago
- Machine Translation Evaluation Metric☆39Dec 6, 2017Updated 8 years ago
- Extract annotated misspellings from MIMIC-III.☆13Dec 17, 2020Updated 5 years ago
- ☆10Apr 4, 2018Updated 7 years ago
- Predictable Feature Analysis☆10Dec 1, 2014Updated 11 years ago
- Implementation of multi-armed bandits in Julia☆12Jan 12, 2020Updated 6 years ago
- Physical Bitcoin Ticker☆13Mar 14, 2016Updated 9 years ago
- forwarding Outlook emails to Telegram. Python/win32com☆12Sep 15, 2016Updated 9 years ago
- Check Luxmed doctor appointment availability.☆11Dec 9, 2018Updated 7 years ago
- Flow-based programming framework☆15Apr 9, 2018Updated 7 years ago
- Tools for automated grading of python assignments.☆10Jul 6, 2019Updated 6 years ago
- Dynamic dispatch over arbitrary predicates☆10Feb 2, 2016Updated 10 years ago
- Codes related to Lord of the Machines hackathon☆10Apr 25, 2018Updated 7 years ago
- This is a machine learning challenge conducted by C&D Labs and Future Group in association with HackerEarth.☆10Nov 17, 2017Updated 8 years ago
- This is a very fast parsing script for downloaded TV shows and movies. It will use scene-standard naming conventions (and a lot of nonsta…☆16Oct 30, 2017Updated 8 years ago
- Small, lightweight UI for private Docker registries. With features for image description and run documentation.☆10Feb 27, 2019Updated 6 years ago
- A very simple 1D Kalman Filter in MATLAB (for teaching)☆14Jan 3, 2017Updated 9 years ago
- Repo for the code used during our Beginner Track: Intro to ML workshop series☆13Oct 2, 2018Updated 7 years ago
- ☆12May 22, 2016Updated 9 years ago
- Cryptocurrency arbitrage bot that buys ETH with BTC at Kraken, transfers the ETH from Kraken to QuadrigaCX, sells the ETH for BTC at Quad…☆10May 25, 2018Updated 7 years ago
- A simple model for classifying papers by academic venue (AI/ML/ACL), given a title and abstract. Bare-metal PyTorch port of https://gith…☆12Mar 22, 2018Updated 7 years ago
- A Playground for Variational Autoencoders☆12Feb 11, 2018Updated 8 years ago
- ☆11Sep 8, 2017Updated 8 years ago
- récriture inclusive des textes en ligne☆15Dec 19, 2022Updated 3 years ago
- Latex resume template☆12Mar 29, 2012Updated 13 years ago
- Code for the Adzuna Salary Prediction Kaggle competition - http://www.kaggle.com/c/job-salary-prediction Placed 10th out of approximately…☆11Apr 10, 2013Updated 12 years ago
- Twitter text processing library (auto linking and extraction of usernames, lists and hashtags). Based on the Java implementation by Matt …☆88Jul 28, 2014Updated 11 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- ☆10Nov 10, 2016Updated 9 years ago
- Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.☆11Jul 12, 2018Updated 7 years ago