☆23Nov 9, 2021Updated 4 years ago
Alternatives and similar repositories for OffpolicyAlgorithms
Users that are interested in OffpolicyAlgorithms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆38Oct 14, 2020Updated 5 years ago
- ☆27Mar 11, 2025Updated last year
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆21Dec 16, 2018Updated 7 years ago
- A library for developing and applying Seldonian algorithms☆12Jan 13, 2024Updated 2 years ago
- A tutorial on doing RL research in Julia using both Jupyter notebooks and normal project structures.☆10Jun 23, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- C++ Thread Pool implementation base on POSIX pthread☆14Mar 17, 2015Updated 11 years ago
- ☆10Apr 24, 2021Updated 5 years ago
- ☆12Jan 31, 2017Updated 9 years ago
- Performances of Reinforcement Learning Agents☆53Dec 19, 2019Updated 6 years ago
- Glob Include Directive for Jade☆10Dec 20, 2015Updated 10 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- NumPy+Jax with named axes and an uncompromising attitude☆23Mar 4, 2025Updated last year
- Safe Reinforcement Learning with Natural Language Constraints☆16Oct 24, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 3D geoms for plotnine (grammar of graphics in Python)☆13Aug 5, 2022Updated 3 years ago
- Binary feature representations with tile coding☆46Sep 14, 2024Updated last year
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Tutorials on learning and using successor representations.☆54Oct 31, 2019Updated 6 years ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆78Mar 6, 2026Updated 3 months ago
- Learning Pytorch☆13Jun 12, 2018Updated 7 years ago
- Tail Call Optimizations in Python☆83Dec 15, 2025Updated 5 months ago
- Implementation of "Training Agents using Upside-Down Reinforcement Learning (https://arxiv.org/pdf/1912.02877.pdf)"☆17Dec 17, 2019Updated 6 years ago
- ☆38Nov 15, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- OpenAI Gym Wrapper for DeepMind Control Suite☆74Nov 30, 2021Updated 4 years ago
- Dynamic channel allocation in cellular networks by reinforcement learning☆18May 25, 2022Updated 4 years ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 4 years ago
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- ☆29Apr 11, 2026Updated 2 months ago
- my take at a PDF text extraction utility☆15Jun 15, 2015Updated 10 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Chord Implementation for Distributed Systems Course☆23Nov 30, 2011Updated 14 years ago
- Higher Order SVD implementation in PyTorch☆13Nov 14, 2022Updated 3 years ago
- Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)☆23May 12, 2023Updated 3 years ago
- A D3 plugin to draw contour plots of 2D functions.☆19Sep 26, 2024Updated last year
- A star for organising blocks and playing with transformers.☆23Apr 28, 2024Updated 2 years ago
- A framework for experimenting with never-ending learning☆81Oct 16, 2024Updated last year
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Jan 14, 2017Updated 9 years ago