This is an implementation of the tic-tac-toe game as a gym environment. It can be used to make the computer learn playing the Tic-Tac-Toe game.
☆26Jan 6, 2019Updated 7 years ago
Alternatives and similar repositories for Tic-Tac-Toe-Gym_Environment
Users that are interested in Tic-Tac-Toe-Gym_Environment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago
- PyTorch Implementation of “Unsupervised learning by competing hidden units” MNIST classifier☆12May 6, 2019Updated 7 years ago
- ☆11Dec 16, 2025Updated 5 months ago
- Javascript/Nodejs sentence similarity. Produces several measures of similarity based on fuzzy matching and a similarity matrix.☆22Jan 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- Dynamic Spectrotemporal Receptive Field (dSTRF) Analysis Toolbox☆21Jun 16, 2022Updated 3 years ago
- 桌面天气预报(基于Qt5,代码结构清晰并含有详细注释)☆11Jul 29, 2023Updated 2 years ago
- Documentation of the Two!Ears Auditory Model☆13Feb 14, 2019Updated 7 years ago
- Mujoco Models for the Fetch Robot☆32Feb 9, 2025Updated last year
- The model code for the Verhulst, Altoè, Vasilkov 2018 Hearing Research publication☆32Nov 5, 2025Updated 6 months ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- LUNA: a Framework for Language Understanding and Naturalness Assessment.☆12Sep 9, 2023Updated 2 years ago
- [experiment] CRF-based disambiguation engine for pymorphy2☆10May 9, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Sawyer environments for reinforcement learning using the OpenAI Gym interface (EXPERIMENTAL)☆37Dec 11, 2019Updated 6 years ago
- Underwater Communication & Navigation Laboratory documentation site☆13Updated this week
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- Implementation of Mean Field Multi-Agent Reinforcement Learning in Pytorch☆22Apr 27, 2024Updated 2 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- Setup generator for the board game Spirit Island 🏝️☆10Nov 24, 2023Updated 2 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Highly Modular and Scalable Reinforcement Learning☆117Jan 14, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- System for automatic pronominal resolution for Russian☆14Apr 3, 2020Updated 6 years ago
- A C++ library for working with OWL2 ontologies.☆12Jan 26, 2016Updated 10 years ago
- Lightweight multiple sound source localization, based on a triangular microphone array.☆16Dec 5, 2023Updated 2 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆39Feb 25, 2019Updated 7 years ago
- Research Project on Multi-robot Target Tracking via Deep Reinforcement Learning☆21Dec 17, 2020Updated 5 years ago
- ☆12Apr 4, 2023Updated 3 years ago
- DRL for Dynamic Vehicle Routing Problem with stochastic customer requests☆19Aug 28, 2023Updated 2 years ago
- A tiny python2.7 script which converts LaTex projects into arxiv-format. Suggestions are welcome.☆10Mar 20, 2016Updated 10 years ago
- Official repository for paper "Goal-Aware Neural SAT Solver"☆17Jun 10, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆46Oct 28, 2025Updated 6 months ago
- Implementation of paper Long-Term Effect Estimation with Surrogate Representation☆14Oct 20, 2020Updated 5 years ago
- ☆11Jan 25, 2023Updated 3 years ago
- Codenames AI☆12Jun 21, 2022Updated 3 years ago
- ☆11Dec 2, 2018Updated 7 years ago
- A multi-language segmenter using high-order CRF.☆17Feb 27, 2020Updated 6 years ago
- ☆11Oct 9, 2021Updated 4 years ago