Single player Alpha Zero implementation
☆42Mar 7, 2022Updated 4 years ago
Alternatives and similar repositories for alphazero_singleplayer
Users that are interested in alphazero_singleplayer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆22May 6, 2023Updated 3 years ago
- ☆16Feb 1, 2022Updated 4 years ago
- ☆13Apr 28, 2026Updated last week
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18May 17, 2019Updated 6 years ago
- Perform Model Checking and POMDP Planning from LTL specifications using POMDPs.jl☆15Aug 8, 2024Updated last year
- Code for the Hamiltonian Variational Auto-Encoder from the proceedings of NeurIPS 2018☆16Oct 2, 2019Updated 6 years ago
- ☆10Jul 4, 2022Updated 3 years ago
- ☆11Apr 8, 2016Updated 10 years ago
- ☆14Oct 27, 2023Updated 2 years ago
- Belief-state planning for POMDPs using learned approximations☆23Jan 21, 2025Updated last year
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆15Sep 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆108Aug 9, 2024Updated last year
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 4 years ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆185Oct 26, 2024Updated last year
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- env for gym, match3 game☆11Jun 2, 2019Updated 6 years ago
- ☆31Apr 2, 2025Updated last year
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- PyTorch implementation for our NeurIPS 2023 spotlight paper "Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with G…☆67May 30, 2023Updated 2 years ago
- ☆13Aug 16, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- See https://youtube-dl.org/☆10Oct 24, 2020Updated 5 years ago
- Android SDK for the Rover platform☆11Apr 29, 2026Updated last week
- ☆17Oct 9, 2024Updated last year
- Documentation on using the built-in Python debugger, PDB.☆23Dec 8, 2022Updated 3 years ago
- An AI agent that use Double Deep Q-learning to teach itself to land a Lunar Lander on OpenAI universe☆17Mar 15, 2021Updated 5 years ago
- 🦎 Minimal Python command-line parser inspired by Facebook's Hydra. Handles and parses arbitrary arguments into dot-accessible nested dic…☆20Jan 20, 2022Updated 4 years ago
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,434Jan 1, 2025Updated last year
- Learning Laplacian Representations in Reinforcement Learning☆18Jan 2, 2021Updated 5 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- Code that accompanies online course about using ChatGPT for data science☆15May 9, 2023Updated 2 years ago
- Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search☆28Nov 15, 2018Updated 7 years ago
- pymhlib - A Toolbox for Metaheuristics and Hybrid Optimization Methods☆31Jun 6, 2023Updated 2 years ago
- Library of common cryptographic algorithms and functions for Pony☆12Jul 16, 2025Updated 9 months ago
- FinanceGPT-B☆10Mar 26, 2024Updated 2 years ago