Applying DeepMind's MuZero algorithm to the cart pole environment in gym
☆22May 6, 2023Updated 3 years ago
Alternatives and similar repositories for muzero-cartpole
Users that are interested in muzero-cartpole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Study of the paper 'Neural Thompson Sampling' published in October 2020☆25Sep 27, 2022Updated 3 years ago
- AlphaZero for continuous control tasks☆23Dec 7, 2022Updated 3 years ago
- ☆55Apr 11, 2023Updated 3 years ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- ☆14Mar 5, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Experiments from our work Uncertainty Quantification and Deep Ensemble☆10Nov 1, 2021Updated 4 years ago
- Simple Goal Oriented Action Planning demo written in Javascript with Phaser for studies.☆12Aug 31, 2015Updated 10 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- A multi-agent soccer simulator in a grid-world environment, with agents implementing different reinforcement learning algorithms☆13Jun 4, 2017Updated 8 years ago
- Reference implementation for the paper titled "Improving Model-Based Reinforcement Learning with Internal State Representations through S…☆12Feb 10, 2021Updated 5 years ago
- Single player Alpha Zero implementation☆42Mar 7, 2022Updated 4 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆37Dec 1, 2023Updated 2 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AlphaZero in JAX☆82Apr 3, 2024Updated 2 years ago
- General purpose, statically typed, functional programming language☆14Dec 6, 2025Updated 5 months ago
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- ☆19Jan 16, 2025Updated last year
- An interactive simulation to explain algorithmic bias.☆13Dec 3, 2022Updated 3 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 3 years ago
- ☆19Dec 29, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ROS Driver for PI-Hexapods☆15Aug 11, 2020Updated 5 years ago
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- A set of solutions to ETHZ ROS lectures☆13Jul 19, 2017Updated 8 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 5 years ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- Scripts for Lectures on Network Systems - Francesco Bullo☆16Oct 22, 2023Updated 2 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- An implementation of Color2Gray with convolutional neural networks☆11Dec 23, 2015Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Paper introducing jax-cosmo☆13Apr 27, 2023Updated 3 years ago
- This repository contains Java code for implementing a ONE Record compliant API.☆19May 17, 2024Updated 2 years ago
- Code for the paper Adversarial Robustness via Adversarial Label-Smoothing☆11Feb 5, 2020Updated 6 years ago
- Python tools for solving data-constrained finite element problems☆13Nov 9, 2021Updated 4 years ago
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- ☆14Oct 27, 2019Updated 6 years ago