Single player Alpha Zero implementation
☆42Mar 7, 2022Updated 4 years ago
Alternatives and similar repositories for alphazero_singleplayer
Users that are interested in alphazero_singleplayer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- ☆16Feb 1, 2022Updated 4 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆19Aug 6, 2018Updated 7 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jul 4, 2022Updated 3 years ago
- Code for the Hamiltonian Variational Auto-Encoder from the proceedings of NeurIPS 2018☆16Oct 2, 2019Updated 6 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- ☆77Mar 23, 2026Updated 2 months ago
- Monte Carlo Tree Search for Markov decision processes using the POMDPs.jl framework☆81Nov 16, 2025Updated 6 months ago
- ☆11Apr 8, 2016Updated 10 years ago
- ☆14Oct 27, 2023Updated 2 years ago
- A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.☆15Feb 17, 2020Updated 6 years ago
- A Python library for controlling AlphaDog robotic dogs.☆12Apr 16, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆18Sep 2, 2024Updated last year
- ☆12Apr 17, 2023Updated 3 years ago
- GMG, Poisson solver and Lid cavity☆12Sep 8, 2025Updated 9 months ago
- Note: "Deep Reinforcement Learning: An Overview"☆12Mar 26, 2018Updated 8 years ago
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆15Sep 29, 2024Updated last year
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- ☆32Jun 10, 2025Updated last year
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 4 years ago
- ☆14Jul 21, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MultiModal Rag with Colpali, Milvus and VLM☆15Dec 22, 2024Updated last year
- ☆27Feb 24, 2024Updated 2 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- Classic MCTS example with mctx☆25May 25, 2023Updated 3 years ago
- ☆20Dec 8, 2022Updated 3 years ago
- env for gym, match3 game☆11Jun 2, 2019Updated 7 years ago
- An reimplement of liif(Learning Continuous Image Representation with Local Implicit Image Function) using lightning+hydra☆11Mar 26, 2021Updated 5 years ago
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆17Apr 9, 2024Updated 2 years ago
- ☆14Aug 16, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Google Pacman clone for Android☆18Jan 21, 2023Updated 3 years ago
- Python scripts to facilitate easy working☆11Mar 23, 2026Updated 2 months ago
- An AI agent that use Double Deep Q-learning to teach itself to land a Lunar Lander on OpenAI universe☆17Mar 15, 2021Updated 5 years ago
- A HMM application in Kritzman Regime Detection☆15Jan 3, 2020Updated 6 years ago
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,460Jan 1, 2025Updated last year
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago