An implementation of AlphaZero and MCTS with neural networks for Tetris
☆22Mar 21, 2025Updated last year
Alternatives and similar repositories for alphazero-tetris
Users that are interested in alphazero-tetris are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Atari-style POMDPs☆28May 13, 2026Updated 2 weeks ago
- An agent for playing Atari games running on a Teensy microcontroller☆15Nov 11, 2022Updated 3 years ago
- Benchmark for evaluating the generalization capabilities of Multi-Objective Reinforcement Learning (MORL) algorithms.☆27Jun 6, 2025Updated 11 months ago
- ☆12Oct 19, 2023Updated 2 years ago
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- A blog for LLVM(v11.0.0) beginner, step by step, with detailed documents and comments. Record the way I learn LLVM.☆14Jun 17, 2022Updated 3 years ago
- Develop your agent for generals.io!☆87Updated this week
- ☆40Feb 14, 2026Updated 3 months ago
- Unofficial Implementation of Null-text Inversion (https://arxiv.org/abs/2211.09794)☆12Nov 20, 2022Updated 3 years ago
- A Pytorch Lightning WGAN-gp to generate faces☆11Jan 26, 2021Updated 5 years ago
- Getting Started in Imitation Learning☆13Mar 3, 2025Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆18Nov 24, 2025Updated 6 months ago
- Official Implementation of SFM and the baselines in Jax.☆21May 31, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆28Jan 14, 2025Updated last year
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- Pointax: PointMaze Environment for JAX☆28Oct 22, 2025Updated 7 months ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model☆28Oct 10, 2024Updated last year
- Tutorial kit for building a 3D deep reinforcement learning environment with Unity ML-Agents.☆11Oct 22, 2021Updated 4 years ago
- A minimal, single-file implementation of the MeanFlow paper on 2D toy examples, with a side-by-side comparison to rectified flow.☆27Jul 4, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.☆66Dec 19, 2025Updated 5 months ago
- Ray Tracer written in Rust☆13Nov 22, 2021Updated 4 years ago
- List of resources for understanding computers ground up☆33Jul 13, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Discovering Quality-Diversity Algorithms via Meta-Black-Box Optimization☆23Dec 1, 2025Updated 5 months ago
- Codebase for Extracting Reward Functions from Diffusion Models☆16Dec 7, 2023Updated 2 years ago
- A worker pool library for Rust☆12May 4, 2026Updated 3 weeks ago
- A synthetic story narration dataset to study small audio LMs.☆31Jan 21, 2024Updated 2 years ago
- Reading list for research topics in Diffusion models.☆18Jan 12, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Nov 8, 2023Updated 2 years ago
- ☆18Apr 11, 2024Updated 2 years ago
- time-bomb.nvim is a minimal Neovim plugin for timers and Pomodoro cycles to boost developer focus. Features floating timers, 9 progress b…☆32Mar 12, 2026Updated 2 months ago
- a minimalistic todo app☆10May 10, 2023Updated 3 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆21Jun 22, 2023Updated 2 years ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago