Connect Four Environment is a project designed for training reinforcement learning models to play the classic Connect4 game. It's compatible with OpenAI Gym / Gymnasium, includes a variety of bots, an Elo leaderboard system, and supports both FCN and CNN policies.
☆18Sep 18, 2023Updated 2 years ago
Alternatives and similar repositories for Connect-4-Gym-env-Reinforcement-learning
Users that are interested in Connect-4-Gym-env-Reinforcement-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ACM UMAP2020 Hands-on Tutorial on Data and Algorithmic Bias in Recommender Systems☆10May 23, 2021Updated 4 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- hacking recaptcha v3 using reinforcement learning☆15Feb 5, 2020Updated 6 years ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- ☆74Mar 23, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- ☆12Apr 17, 2023Updated 3 years ago
- OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift. ICML 2024 and ICLRW-DMLR 2024☆23Jul 25, 2024Updated last year
- ☆14Jul 21, 2022Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- ☆21Jun 12, 2023Updated 2 years ago
- ☆13Jul 13, 2022Updated 3 years ago
- ☆17Jun 18, 2024Updated last year
- Implementation of Deepmind's Neural Episodic Control☆59May 9, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Open AI Gym Environment For MIMIC Dataset Sepsis Patient☆24Dec 8, 2022Updated 3 years ago
- ☆17Oct 9, 2024Updated last year
- This is a personal library that strives to implement various MARL algorithms. The environment only integrates MPE, and the algorithm curr…☆15May 22, 2025Updated 11 months ago
- A HMM application in Kritzman Regime Detection☆15Jan 3, 2020Updated 6 years ago
- 🦎 Minimal Python command-line parser inspired by Facebook's Hydra. Handles and parses arbitrary arguments into dot-accessible nested dic…☆20Jan 20, 2022Updated 4 years ago
- DICE climate integrated assessment model☆13Feb 23, 2022Updated 4 years ago
- Evaluating Robustness of Predictive Uncertainty Estimation: Are Dirichlet-based Models Reliable ? (ICML 2021)☆28Nov 28, 2022Updated 3 years ago
- Paper and code for Gradient Descent: The Ultimate Optimizer☆24Oct 3, 2023Updated 2 years ago
- ☆22May 10, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Pytorch code for the paper "The color out of space: learning self-supervised representations for Earth Observation imagery"☆18Oct 26, 2021Updated 4 years ago
- AI can help Visualizing the Impacts of Climate Change. This is an open forum to share our work☆19Jun 3, 2020Updated 5 years ago
- ☆28Oct 26, 2020Updated 5 years ago
- ICU-Sepsis is a lightweight, yet challenging RL environment that models the treatment of sepsis in the ICU.☆42Oct 23, 2024Updated last year
- ☆26Feb 6, 2022Updated 4 years ago
- Safe Reinforcement Learning for Autonomous Underwater Vehicles☆34Oct 7, 2024Updated last year
- curriculum☆27Feb 7, 2023Updated 3 years ago
- Lecture notes of Real Analysis III in bilibili☆28Jan 28, 2023Updated 3 years ago
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Sep 25, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Population Based Reinforcement Learning Library based on PyTorch☆27Mar 5, 2023Updated 3 years ago
- This is attempts trying to make a fan redo version of Ra2(Red alert 2).☆20Jul 29, 2023Updated 2 years ago
- ☆20Apr 11, 2024Updated 2 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆35Oct 28, 2025Updated 6 months ago
- This is the official code repository for the paper "Language Agents Meet Causality -- Bridging LLMs and Causal World Models"☆29May 6, 2025Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- Safe exploration in Markov Decision Processes☆37Nov 14, 2017Updated 8 years ago