An implementation of AlphaZero and MCTS with neural networks for Tetris
☆22Mar 21, 2025Updated last year
Alternatives and similar repositories for alphazero-tetris
Users that are interested in alphazero-tetris are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Atari-style POMDPs☆25Feb 13, 2026Updated last month
- A fully configurable Gymnasium compatible Tetris environment☆44Feb 28, 2026Updated last month
- An agent for playing Atari games running on a Teensy microcontroller☆15Nov 11, 2022Updated 3 years ago
- Benchmark for evaluating the generalization capabilities of Multi-Objective Reinforcement Learning (MORL) algorithms.☆26Jun 6, 2025Updated 9 months ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated 11 months ago
- Develop your agent for generals.io!☆77Mar 20, 2026Updated last week
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆17Jul 11, 2023Updated 2 years ago
- ☆10Sep 21, 2024Updated last year
- Unveiling the Layers: Neural Networks from first principles☆10Oct 1, 2025Updated 5 months ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆18Nov 24, 2025Updated 4 months ago
- Reading list for adversarial perspective and robustness in deep reinforcement learning.☆131Mar 2, 2026Updated 3 weeks ago
- Reading Group @mila-iqia on Computational Optimal Transport for Machine Learning Applications☆13Jun 3, 2022Updated 3 years ago
- Official Implementation of SFM and the baselines in Jax.☆21May 31, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pointax: PointMaze Environment for JAX☆26Oct 22, 2025Updated 5 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆27Jan 14, 2025Updated last year
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.☆11Aug 21, 2023Updated 2 years ago
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- JAX implementation of RL algorithms and vectorized environments☆51Dec 26, 2023Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆238Nov 24, 2025Updated 4 months ago
- Tutorial kit for building a 3D deep reinforcement learning environment with Unity ML-Agents.☆11Oct 22, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.☆63Dec 19, 2025Updated 3 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated last month
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆21Dec 1, 2025Updated 3 months ago
- This is a repo covers ai research papers pseudocodes☆17Jun 20, 2023Updated 2 years ago
- Codebase for Extracting Reward Functions from Diffusion Models☆16Dec 7, 2023Updated 2 years ago
- Reading list for research topics in Diffusion models.☆18Jan 12, 2024Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Oct 23, 2022Updated 3 years ago
- ☆18Nov 8, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- time-bomb.nvim is a minimal Neovim plugin for timers and Pomodoro cycles to boost developer focus. Features floating timers, 9 progress b…☆32Mar 12, 2026Updated 2 weeks ago
- a minimalistic todo app☆10May 10, 2023Updated 2 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- The code for the video tutorial series on building a Transformer from scratch: https://www.youtube.com/watch?v=XR4VDnJzB8o☆19Apr 15, 2023Updated 2 years ago
- Implementation of the ByteDance MagicMix paper☆19Nov 4, 2022Updated 3 years ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago