AlphaZero for continuous control tasks
☆23Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for alphazero-gym
Users that are interested in alphazero-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆22May 6, 2023Updated 3 years ago
- Aerial Combat environment build around PyFlyt☆12Aug 12, 2023Updated 2 years ago
- Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/ab…☆20Jan 25, 2018Updated 8 years ago
- Gym environment which simulates intraday trading☆28Feb 9, 2022Updated 4 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- List of awesome JAX resources☆13Dec 8, 2022Updated 3 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- Single player Alpha Zero implementation☆42Mar 7, 2022Updated 4 years ago
- Core interface to design, solve, and simulate trajectory games.☆21Dec 6, 2024Updated last year
- A custom interior point solver for mixed complementarity problems.☆19Apr 20, 2026Updated 2 weeks ago
- ☆19Jan 16, 2025Updated last year
- ☆12Mar 6, 2020Updated 6 years ago
- Annotated implementation of vanilla Transformers to guide through all the ambiguities.☆10Jun 20, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A python implementation of the COACH algorithm for the Cartpole problem in OpenAI gym.☆11Mar 15, 2019Updated 7 years ago
- Baxter-like robotic arm that can be trained with human hands and a button press.☆17Aug 18, 2022Updated 3 years ago
- Experiments and content for the "Accelerating hyperbolic t-SNE" paper.☆19Apr 30, 2026Updated last week
- A gym game for Contra that for reinforcement learning☆10Oct 18, 2021Updated 4 years ago
- Open DRUWA - Open Deep Realtime User Welcoming Assistant☆16Nov 4, 2022Updated 3 years ago
- If you want a online gym, this is the perfect page. You have some filters and inputs fields in order to find your perfect routine.☆15Mar 3, 2023Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- Linear Regression, Logistic Regression, and MLP Neural Networks in a tiny educational package.☆14Apr 24, 2017Updated 9 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- ☆12Sep 8, 2022Updated 3 years ago
- Simulated Baxter Robot writing "hello"☆10Jan 15, 2016Updated 10 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"☆12Apr 4, 2022Updated 4 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- YOLO meets Optical Flow☆14Oct 13, 2022Updated 3 years ago
- ☆22Jan 14, 2020Updated 6 years ago
- Behavioural and Dynamic Learning Network (BunDLe-Net) is an algorithm to learn meaningful coarse-grained representations from time-series…☆16Apr 15, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 7 years ago
- Robot payload estimation, based on: C. G. Atkeson, C. H. An, and J. M. Hollerbach, “Estimation of Inertial Parameters of Manipulator Load…☆13Apr 13, 2022Updated 4 years ago
- Object detection using webcam or mobile camera in the browser. Written in Tensorflow.js☆13Apr 12, 2019Updated 7 years ago
- ☆13Aug 23, 2023Updated 2 years ago
- Python code to teleoperate the Baxter industrial robot using Kinect, Oculus Rift, and a web interface.☆12Jun 30, 2015Updated 10 years ago
- GYM is an easy-to-use gym management and administration system. It helps you to keep track of the records of your members and their membe…☆11May 18, 2025Updated 11 months ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆27May 2, 2025Updated last year