This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe game.
☆26Jan 6, 2019Updated 7 years ago
Alternatives and similar repositories for Tic-Tac-Toe-Gym_Environment
Users that are interested in Tic-Tac-Toe-Gym_Environment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- MuJoCo Models for Personal Robot 2 (PR2)☆11Aug 25, 2018Updated 7 years ago
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago
- Investigation for PyDataLondon 2023 and ODSC 2023 conference comparing Pandas 2, Polars and Dask☆11Dec 7, 2023Updated 2 years ago
- ☆11Dec 16, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Differential game theory for multi-agent collision avoidance. Simulations set up.☆12Jan 27, 2021Updated 5 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 6 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- This project offers to solve Multi-Agent-Path-Finding(MAPF) problem optimally using Conflict-Based Search(CBS).☆13Aug 31, 2022Updated 3 years ago
- A concise PyTorch implementation of Proximal Policy Optimization(PPO) solving CartPole-v0☆16Jun 11, 2020Updated 5 years ago
- Topics of conferences☆12Jul 12, 2016Updated 9 years ago
- The goal of this project is to use scanning lidar to create a map which will enable autonomous navigation of a simple robot☆14Jun 27, 2021Updated 4 years ago
- Analysis of Russian mass media articles about internet regulation with structural topic modeling☆11May 15, 2018Updated 7 years ago
- Program used in conjunction with Autodesk Inventor 2013 to convert assemblies into Universal Robot Description Format (URDF) for ROS.☆33Jun 19, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Sawyer environments for reinforcement learning using the OpenAI Gym interface (EXPERIMENTAL)☆37Dec 11, 2019Updated 6 years ago
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- Implementation of Mean Field Multi-Agent Reinforcement Learning in Pytorch☆20Apr 27, 2024Updated last year
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10May 22, 2023Updated 2 years ago
- Legate Hello World Pedagogical Library☆10Apr 5, 2023Updated 2 years ago
- CORALL (COLREGs-guided Risk Aware LLM) is a novel framework that integrates Large Language Models with real-time risk assessment for auto…☆23Feb 11, 2026Updated last month
- Highly Modular and Scalable Reinforcement Learning☆118Jan 14, 2020Updated 6 years ago
- System for automatic pronominal resolution for Russian☆14Apr 3, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Lightweight multiple sound source localization, based on a triangular microphone array.☆16Dec 5, 2023Updated 2 years ago
- Extended Kalman filter for attitude estimation on a multi-IMU configuration☆13Sep 2, 2022Updated 3 years ago
- Research Project on Multi-robot Target Tracking via Deep Reinforcement Learning☆21Dec 17, 2020Updated 5 years ago
- ☆12Apr 4, 2023Updated 2 years ago
- Official repository for paper "Goal-Aware Neural SAT Solver"☆17Jun 10, 2023Updated 2 years ago
- Computable protocol wiki☆11Mar 26, 2018Updated 8 years ago
- Software codes for running the Game-theoretic Utility Tree (GUT) algorithm for the multi-robot Pursuit-Evasion problem in the Robotarium'…☆26Jul 14, 2022Updated 3 years ago
- Implementation of paper Long-Term Effect Estimation with Surrogate Representation☆14Oct 20, 2020Updated 5 years ago
- ☆45Oct 28, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- PyMT4 - Python bindings for the Metatrader 4 trading platform Project origin By rmawatson, he didn't want to be disturbed, So don't to …☆13Aug 6, 2018Updated 7 years ago
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)☆12Aug 13, 2022Updated 3 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Codenames AI☆12Jun 21, 2022Updated 3 years ago
- Tool kit to accelerate exploratory data analysis and data cleaning☆11Mar 22, 2021Updated 5 years ago
- ☆11Dec 2, 2018Updated 7 years ago
- A dataset for realistic evaluation of noisy label methods☆14Dec 3, 2023Updated 2 years ago