AlphaZero for continuous control tasks
☆23Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for alphazero-gym
Users that are interested in alphazero-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆26Feb 14, 2026Updated 2 months ago
- Gym environment which simulates intraday trading☆28Feb 9, 2022Updated 4 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- a little library to help me with things involving Koopman operators☆12Mar 3, 2022Updated 4 years ago
- [TVCG 2021] Consistent Two-Flow Network for Tele-Registration of Point Clouds☆11Aug 9, 2021Updated 4 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Core interface to design, solve, and simulate trajectory games.☆21Dec 6, 2024Updated last year
- ☆19Jan 16, 2025Updated last year
- ☆12Mar 6, 2020Updated 6 years ago
- A neural network accelerated solver for mixed-strategy solutions of trajectory games. Do you even lift?☆18Jun 22, 2025Updated 9 months ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- Annotated implementation of vanilla Transformers to guide through all the ambiguities.☆10Jun 20, 2025Updated 9 months ago
- Using self-play, MCTS, and a deep neural network to create a hearthstone ai player☆29Nov 1, 2018Updated 7 years ago
- This is a Python project implementing the Hidden Points Removal operator on a point cloud seen from a chosen point of view☆10Nov 13, 2015Updated 10 years ago
- A python implementation of the COACH algorithm for the Cartpole problem in OpenAI gym.☆11Mar 15, 2019Updated 7 years ago
- ☆13Sep 23, 2021Updated 4 years ago
- Baxter-like robotic arm that can be trained with human hands and a button press.☆17Aug 18, 2022Updated 3 years ago
- Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs☆15Jun 20, 2016Updated 9 years ago
- Experiments and content for the "Accelerating hyperbolic t-SNE" paper.☆18Aug 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A gym game for Contra that for reinforcement learning☆10Oct 18, 2021Updated 4 years ago
- If you want a online gym, this is the perfect page. You have some filters and inputs fields in order to find your perfect routine.☆15Mar 3, 2023Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- Implemented YOLOv2 with Tensorflow 2.0☆10Oct 6, 2022Updated 3 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- Simulated Baxter Robot writing "hello"☆10Jan 15, 2016Updated 10 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"☆12Apr 4, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Jul 10, 2022Updated 3 years ago
- YOLO meets Optical Flow☆14Oct 13, 2022Updated 3 years ago
- Behavioural and Dynamic Learning Network (BunDLe-Net) is an algorithm to learn meaningful coarse-grained representations from time-series…☆15Apr 15, 2025Updated last year
- This repository provides a GitHub Action for running the Kani Rust Verifier in CI.☆12May 13, 2025Updated 11 months ago
- Classic MCTS example with mctx☆24May 25, 2023Updated 2 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- Robot payload estimation, based on: C. G. Atkeson, C. H. An, and J. M. Hollerbach, “Estimation of Inertial Parameters of Manipulator Load…☆13Apr 13, 2022Updated 4 years ago