Reinforcement Learning with Convex Constraints
☆14Apr 6, 2022Updated 4 years ago
Alternatives and similar repositories for ApproPO
Users that are interested in ApproPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Active Imitation Learing with Noisy Guidance☆10May 29, 2020Updated 5 years ago
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- Examples for comparing and merging versions of Jupyter notebooks☆13Sep 5, 2023Updated 2 years ago
- Code for "Learning Local Control Barrier Functions for Safety Control of Hybrid Systems"☆14Jan 29, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Trigonometric functions for fixed-point numbers☆19Sep 10, 2022Updated 3 years ago
- A simple bus simulation environment for bus bunching analysis in New York City☆14Oct 7, 2018Updated 7 years ago
- ☆15Oct 16, 2020Updated 5 years ago
- Methods for Non-Smooth Convex Optimization (NSO), written in Python☆29Feb 6, 2024Updated 2 years ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆61Jul 21, 2020Updated 5 years ago
- Implementation of Grid-Forming HAC for Converter Connected to an Infinite Bus☆17Feb 10, 2021Updated 5 years ago
- Build-to-Order BLAS☆12Apr 9, 2019Updated 7 years ago
- A DP beam-search extension of Mitchell Stern's span-based neural constituency parser☆11Aug 24, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Feb 1, 2024Updated 2 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Target Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning☆10Jul 2, 2019Updated 6 years ago
- Code for human intervention reinforcement learning☆35Jan 8, 2018Updated 8 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆16May 13, 2025Updated 11 months ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Exact and heuristic methods for tearing☆13Sep 2, 2023Updated 2 years ago
- mhar☆22Feb 15, 2024Updated 2 years ago
- A collection of useful, free, single-file libraries for C.☆11Oct 15, 2015Updated 10 years ago
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- An interactive tool for analyzing, executing, and improving dynamic programming algorithms.☆22Jan 30, 2026Updated 2 months ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Optimally-weighted herding is Bayesian Quadrature☆16Jul 8, 2016Updated 9 years ago
- [TOIS'24] "RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation"☆16Dec 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- The interface between probabilistic model checking and data-driven policy learning.☆18Apr 3, 2026Updated last week
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆62Aug 9, 2022Updated 3 years ago
- PyTorch implementation of various reinforcement learning algorithms☆18Feb 22, 2018Updated 8 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing☆22Sep 24, 2024Updated last year
- a sample code for utilizing torch.distributed☆21Aug 25, 2020Updated 5 years ago