☆23Nov 9, 2021Updated 4 years ago
Alternatives and similar repositories for OffpolicyAlgorithms
Users that are interested in OffpolicyAlgorithms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆38Oct 14, 2020Updated 5 years ago
- ☆27Mar 11, 2025Updated last year
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 5 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆21Dec 16, 2018Updated 7 years ago
- A library for developing and applying Seldonian algorithms☆12Jan 13, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A tutorial on doing RL research in Julia using both Jupyter notebooks and normal project structures.☆10Jun 23, 2021Updated 5 years ago
- C++ Thread Pool implementation base on POSIX pthread☆14Mar 17, 2015Updated 11 years ago
- 🪗 Dynamic concurrency limits for controlling backpressure, inspired by TCP congestion control☆16May 4, 2026Updated last month
- My awesome i3 configuration☆17Aug 7, 2022Updated 3 years ago
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated 3 months ago
- A CLI app for taking simple notes without ever leaving the terminal.☆12Jan 7, 2019Updated 7 years ago
- ☆10Apr 24, 2021Updated 5 years ago
- ☆12Jan 31, 2017Updated 9 years ago
- Performances of Reinforcement Learning Agents☆53Dec 19, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Glob Include Directive for Jade☆10Dec 20, 2015Updated 10 years ago
- A collection of notebooks aiding the understanding of machine-learning papers.☆10Apr 5, 2021Updated 5 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- NumPy+Jax with named axes and an uncompromising attitude☆23Mar 4, 2025Updated last year
- 3D geoms for plotnine (grammar of graphics in Python)☆13Aug 5, 2022Updated 3 years ago
- Binary feature representations with tile coding☆46Sep 14, 2024Updated last year
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Tutorials on learning and using successor representations.☆54Oct 31, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial Experiments with AlgebraNets☆17Jun 17, 2020Updated 6 years ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆79Mar 6, 2026Updated 3 months ago
- Learning Pytorch☆13Jun 12, 2018Updated 8 years ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Dec 17, 2019Updated 6 years ago
- OpenAI Gym Wrapper for DeepMind Control Suite☆74Nov 30, 2021Updated 4 years ago
- Implementation of "Reinforcement Learning in Possibly Nonstationary Environments"☆10Mar 10, 2025Updated last year
- Dynamic channel allocation in cellular networks by reinforcement learning☆18May 25, 2022Updated 4 years ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 5 years ago
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆79Jul 17, 2023Updated 2 years ago
- ☆29Apr 11, 2026Updated 2 months ago
- my take at a PDF text extraction utility☆15Jun 15, 2015Updated 11 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Posterior with interesting shapes from actually used models☆13Feb 10, 2025Updated last year
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 3 years ago