Applying DeepMind's MuZero algorithm to the cart pole environment in gym
☆22May 6, 2023Updated 3 years ago
Alternatives and similar repositories for muzero-cartpole
Users that are interested in muzero-cartpole are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Study of the paper 'Neural Thompson Sampling' published in October 2020☆24Sep 27, 2022Updated 3 years ago
- ☆55Apr 11, 2023Updated 3 years ago
- A collection of meta-learning algorithms in Jax☆25Sep 3, 2022Updated 3 years ago
- ☆14Mar 5, 2026Updated 2 months ago
- Experiments from our work Uncertainty Quantification and Deep Ensemble☆10Nov 1, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple Goal Oriented Action Planning demo written in Javascript with Phaser for studies.☆12Aug 31, 2015Updated 10 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- WebTerm is a Terminal emulator that runs in the browser. It uses v86 to create a virtual linux via WebAssembly and xterm.js as the termin…☆17Apr 28, 2021Updated 5 years ago
- Reference implementation for the paper titled "Improving Model-Based Reinforcement Learning with Internal State Representations through S…☆12Feb 10, 2021Updated 5 years ago
- ☆15Mar 30, 2024Updated 2 years ago
- Single player Alpha Zero implementation☆42Mar 7, 2022Updated 4 years ago
- ☆11May 15, 2020Updated 5 years ago
- Example of binding a TF32 CUTLASS GEMM kernel to PyTorch☆12Jun 7, 2024Updated last year
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AlphaZero in JAX☆83Apr 3, 2024Updated 2 years ago
- Udacity Deep Reinforcement Learning Nanodegree Program☆11Jul 12, 2019Updated 6 years ago
- ☆19Jan 16, 2025Updated last year
- Soccer toy example simulator used in Reinforcement Learning☆12Mar 11, 2018Updated 8 years ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- ☆18Dec 29, 2018Updated 7 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- ROS Driver for PI-Hexapods☆15Aug 11, 2020Updated 5 years ago
- Analysis of the MovieLens dataset of movie ratings and reviews.☆11Sep 2, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆16Dec 15, 2016Updated 9 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- A set of solutions to ETHZ ROS lectures☆13Jul 19, 2017Updated 8 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Accompanying code for the paper "Conditional Unscented Autoencoders for Trajectory Prediction"☆16Sep 6, 2024Updated last year
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- ☆13Mar 11, 2018Updated 8 years ago
- Classic MCTS example with mctx☆25May 25, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scripts for Lectures on Network Systems - Francesco Bullo☆16Oct 22, 2023Updated 2 years ago
- RecyclerView adapter for sectioned item like Swift table view☆16Oct 25, 2017Updated 8 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year
- An implementation of Color2Gray with convolutional neural networks☆11Dec 23, 2015Updated 10 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Paper introducing jax-cosmo☆13Apr 27, 2023Updated 3 years ago
- This repository contains Java code for implementing a ONE Record compliant API.☆19May 17, 2024Updated last year