AlphaZero for continuous control tasks
☆23Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for alphazero-gym
Users that are interested in alphazero-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆22May 6, 2023Updated 3 years ago
- Aerial Combat environment build around PyFlyt☆12Aug 12, 2023Updated 2 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- A graph-based streamflow modelling system in Julialang☆14Apr 2, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- a little library to help me with things involving Koopman operators☆12Mar 3, 2022Updated 4 years ago
- Graduate topics course on statistics and the cosmic microwave background☆14Jun 23, 2016Updated 9 years ago
- Richard @rdgao & Michael @michaeldeistler: using neural network-based regression and density estimation for Generalized Bayesian Inferenc…☆11Dec 19, 2025Updated 5 months ago
- Core interface to design, solve, and simulate trajectory games.☆21Dec 6, 2024Updated last year
- Original SACSMA-Snow17 Fortran Code☆15Jul 22, 2020Updated 5 years ago
- A custom interior point solver for mixed complementarity problems.☆19Apr 20, 2026Updated last month
- ☆12Mar 6, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A neural network accelerated solver for mixed-strategy solutions of trajectory games. Do you even lift?☆18Jun 22, 2025Updated 11 months ago
- SIFT的代码实现 以及kmeans visual words☆11Oct 18, 2020Updated 5 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Evolutionary Algorithms code examples. UC Davis ECI 263.☆16Oct 18, 2018Updated 7 years ago
- Annotated implementation of vanilla Transformers to guide through all the ambiguities.☆10Jun 20, 2025Updated 11 months ago
- Baxter-like robotic arm that can be trained with human hands and a button press.☆17Aug 18, 2022Updated 3 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- ☆13Jul 13, 2022Updated 3 years ago
- Open DRUWA - Open Deep Realtime User Welcoming Assistant☆16Nov 4, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- If you want a online gym, this is the perfect page. You have some filters and inputs fields in order to find your perfect routine.☆15Mar 3, 2023Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆60Aug 4, 2022Updated 3 years ago
- C++ implementation of multi-layer feed forward neural networks with back propagation algorithm.☆10Mar 30, 2016Updated 10 years ago
- Implemented YOLOv2 with Tensorflow 2.0☆10Oct 6, 2022Updated 3 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- Simulated Baxter Robot writing "hello"☆10Jan 15, 2016Updated 10 years ago
- Assorted notebooks for my Scientific work☆22Apr 23, 2026Updated last month
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"☆12Apr 4, 2022Updated 4 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- YOLO meets Optical Flow☆14Oct 13, 2022Updated 3 years ago
- Behavioural and Dynamic Learning Network (BunDLe-Net) is an algorithm to learn meaningful coarse-grained representations from time-series…☆16Apr 15, 2025Updated last year
- This repository provides a GitHub Action for running the Kani Rust Verifier in CI.☆13May 13, 2025Updated last year
- Robot payload estimation, based on: C. G. Atkeson, C. H. An, and J. M. Hollerbach, “Estimation of Inertial Parameters of Manipulator Load…☆13Apr 13, 2022Updated 4 years ago
- Simple ML Algorithm to detect licence plates☆13Nov 22, 2022Updated 3 years ago