This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe game.
☆26Jan 6, 2019Updated 7 years ago
Alternatives and similar repositories for Tic-Tac-Toe-Gym_Environment
Users that are interested in Tic-Tac-Toe-Gym_Environment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Step-By-Step tutorial to build and deploy an image classification API☆15Nov 21, 2022Updated 3 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- MuJoCo Models for Personal Robot 2 (PR2)☆11Aug 25, 2018Updated 7 years ago
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Mar 24, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Spelling Correction with Bidirectional LSTM + Attention Mechanism.☆11Oct 16, 2021Updated 4 years ago
- PyTorch Implementation of “Unsupervised learning by competing hidden units” MNIST classifier☆12May 6, 2019Updated 7 years ago
- Learning Long-Horizon Robot Exploration Strategies for Multi-Object Search in Continuous Action Spaces. http://multi-object-search.cs.uni…☆13Nov 29, 2022Updated 3 years ago
- Differential game theory for multi-agent collision avoidance. Simulations set up.☆12Jan 27, 2021Updated 5 years ago
- ☆11Dec 16, 2025Updated 5 months ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- Dynamic Spectrotemporal Receptive Field (dSTRF) Analysis Toolbox☆21Jun 16, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This project offers to solve Multi-Agent-Path-Finding(MAPF) problem optimally using Conflict-Based Search(CBS).☆14Aug 31, 2022Updated 3 years ago
- ☆13Nov 14, 2022Updated 3 years ago
- Official Implementation of MFG-RGBT-Tracking with PyTorch☆15Aug 10, 2020Updated 5 years ago
- Topics of conferences☆12Jul 12, 2016Updated 9 years ago
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Apr 13, 2026Updated last month
- LUNA: a Framework for Language Understanding and Naturalness Assessment.☆12Sep 9, 2023Updated 2 years ago
- The goal of this project is to use scanning lidar to create a map which will enable autonomous navigation of a simple robot☆14Jun 27, 2021Updated 4 years ago
- Analysis of Russian mass media articles about internet regulation with structural topic modeling☆11May 15, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Solving rectangle-fit problem using meta-heuristic approaches☆12Jan 20, 2017Updated 9 years ago
- Program used in conjunction with Autodesk Inventor 2013 to convert assemblies into Universal Robot Description Format (URDF) for ROS.☆34Jun 19, 2017Updated 8 years ago
- This repository contains implementation of A2C with GAE, which is used to control robot in MuJoCo environment.☆10Jan 6, 2020Updated 6 years ago
- Sawyer environments for reinforcement learning using the OpenAI Gym interface (EXPERIMENTAL)☆37Dec 11, 2019Updated 6 years ago
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- Setup generator for the board game Spirit Island 🏝️☆10Nov 24, 2023Updated 2 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10May 22, 2023Updated 3 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Legate Hello World Pedagogical Library☆10Apr 5, 2023Updated 3 years ago
- System for automatic pronominal resolution for Russian☆13Apr 3, 2020Updated 6 years ago
- Extended Kalman filter for attitude estimation on a multi-IMU configuration☆13Sep 2, 2022Updated 3 years ago
- Research Project on Multi-robot Target Tracking via Deep Reinforcement Learning☆21Dec 17, 2020Updated 5 years ago
- DRL for Dynamic Vehicle Routing Problem with stochastic customer requests☆19Aug 28, 2023Updated 2 years ago
- ☆15Sep 6, 2021Updated 4 years ago
- This project showcases a comprehensive analysis of CO2 emissions in a fictitious cheese manufacturing supply chain using both graph datab…☆11Sep 18, 2024Updated last year