chiamp/muzero-cartpole

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chiamp/muzero-cartpole)

chiamp / muzero-cartpole

Applying DeepMind's MuZero algorithm to the cart pole environment in gym

☆22

Alternatives and similar repositories for muzero-cartpole

Users that are interested in muzero-cartpole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kenjyoung / mctx_learning_demo
View on GitHub
☆55Apr 11, 2023Updated 3 years ago
tristandeleu / jax-meta-learning
View on GitHub
A collection of meta-learning algorithms in Jax
☆24Sep 3, 2022Updated 3 years ago
mhlr / awesome-jax
View on GitHub
List of awesome JAX resources
☆13Dec 8, 2022Updated 3 years ago
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
sinairv / GridSoccerSimulator
View on GitHub
A multi-agent soccer simulator in a grid-world environment, with agents implementing different reinforcement learning algorithms
☆13Jun 4, 2017Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
minie4 / WebTerm
View on GitHub
WebTerm is a Terminal emulator that runs in the browser. It uses v86 to create a virtual linux via WebAssembly and xterm.js as the termin…
☆17Apr 28, 2021Updated 5 years ago
NTT123 / a0-jax
View on GitHub
AlphaZero in JAX
☆82Apr 3, 2024Updated 2 years ago
BrunoKM / deep-pilco-torch
View on GitHub
Deep PILCO PyTorch Implementation
☆15Mar 25, 2023Updated 3 years ago
BastianBlokland / novus
View on GitHub
General purpose, statically typed, functional programming language
☆14May 30, 2026Updated last month
winds-line / deep-MCTS
View on GitHub
☆10May 15, 2020Updated 6 years ago
deterministic-algorithms-lab / Jax-Journey
View on GitHub
A pathway and collection of resources to learning Jax from beginning to advance.
☆11Jan 2, 2021Updated 5 years ago
MIT-REALM / dcrl
View on GitHub
Density Constrained Reinforcement Learning
☆12Mar 24, 2023Updated 3 years ago
AliaElKattan / survivalofthebestfit
View on GitHub
An interactive simulation to explain algorithmic bias.
☆13Dec 3, 2022Updated 3 years ago
fouber / goal
View on GitHub
A html5 football game
☆17Sep 16, 2014Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PI-PhysikInstrumente / PI_ROS_Driver
View on GitHub
ROS Driver for PI-Hexapods
☆15Aug 11, 2020Updated 5 years ago
Farama-Foundation / Procgen-Staging
View on GitHub
Procgen2: A community maintained fork of procgen
☆12Aug 25, 2022Updated 3 years ago
sanketloke / dlmultiagentsoccer
View on GitHub
Deep Reinforcement Learning for Multi Agent Soccer
☆16Dec 15, 2016Updated 9 years ago
ChangYong-Oh / HyperSphere
View on GitHub
☆19Dec 29, 2018Updated 7 years ago
n2cholas / progan-flax
View on GitHub
Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation
☆12May 24, 2021Updated 5 years ago
jlwu002 / BCL
View on GitHub
[ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
☆12Jul 15, 2022Updated 4 years ago
Carbon225 / mctx-classic
View on GitHub
Classic MCTS example with mctx
☆25May 25, 2023Updated 3 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
yangky11 / CNN-Color2Gray
View on GitHub
An implementation of Color2Gray with convolutional neural networks
☆11Dec 23, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lowrollr / mctx-az
View on GitHub
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆27May 2, 2025Updated last year
alexub / jax-meta-learning
View on GitHub
Simple, extensible implementations of some meta-learning algorithms in Jax
☆11Oct 6, 2020Updated 5 years ago
IATA-Cargo / one-record-server-java
View on GitHub
This repository contains Java code for implementing a ONE Record compliant API.
☆20May 17, 2024Updated 2 years ago
alan-turing-institute / stat-fem
View on GitHub
Python tools for solving data-constrained finite element problems
☆13Nov 9, 2021Updated 4 years ago
AIwithSwift / TFWorld2019-SwiftIn3Hours
View on GitHub
☆15Oct 29, 2019Updated 6 years ago
MGoibert / Label_smoothing
View on GitHub
Code for the paper Adversarial Robustness via Adversarial Label-Smoothing
☆11Feb 5, 2020Updated 6 years ago
chaoql / CCF-AIOps-Code
View on GitHub
2024CCF国际AIOps挑战赛-赛道二（GLM4）：基于检索增强的运维知识问答挑战赛解决方案分享。
☆14Jul 5, 2024Updated 2 years ago
AranKomat / Alpha-Transformer
View on GitHub
Alpha Zero equipped with Transformer with various novel techniques for speedup in tree search
☆28Nov 15, 2018Updated 7 years ago
thevasudevgupta / speech-jax
View on GitHub
Speech in Flax/JAX
☆14Jul 11, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JTColonel / timbre-interp
View on GitHub
Autoencoder Based Real-Time Timbre Interpolation Algorithm
☆12Aug 17, 2020Updated 5 years ago
APLA-Toolbox / PythonPDDL
View on GitHub
A dependency-free, pure-Python PDDL planning framework: parser, grounder, classical-to-SOTA planners, heuristics and a benchmarking harne…
☆33Updated this week
cloudworkflow / workflow-performance-prediction-jii
View on GitHub
☆12Jan 5, 2025Updated last year
cwfparsonson / ddls
View on GitHub
Distributed deep learning cluster simulation environment and RL-GNN resource management implementations.
☆14Feb 1, 2023Updated 3 years ago
RobinKa / jaxga
View on GitHub
Geometric Algebra package for JAX
☆63Nov 6, 2021Updated 4 years ago
effdotsh / ssbm-bot
View on GitHub
An AI that learns to play Super Smash Bros. Melee via imitation learning
☆23Feb 22, 2024Updated 2 years ago
kasimte / adversarial-attacks-in-pytorch-example
View on GitHub
Fast Gradient Sign Method and Iterative Least-Likely Class, using LeNet and DenseNet in PyTorch
☆10Nov 18, 2019Updated 6 years ago