Constrained episodic reinforcement learning in concave-convex and knapsack settings
☆11Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for ConRL
Users that are interested in ConRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reinforcement Learning with Convex Constraints☆14Apr 6, 2022Updated 3 years ago
- Active Imitation Learing with Noisy Guidance☆10May 29, 2020Updated 5 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- Three Agent-Based Simulation for Edge Computing in 5G and Beyond for the recent paper titled "Design and Simulation of a Hybrid Architect…☆20Oct 26, 2021Updated 4 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Jul 26, 2022Updated 3 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 2 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆22Nov 29, 2025Updated 3 months ago
- FPsolve: solver for polynomial equations over omega-continuous semirings☆11Aug 15, 2015Updated 10 years ago
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- My Simulation platform for edge computing based on Veins which includes SUMO and OMNet++☆15May 17, 2020Updated 5 years ago
- Build-to-Order BLAS☆12Apr 9, 2019Updated 6 years ago
- A DP beam-search extension of Mitchell Stern's span-based neural constituency parser☆11Aug 24, 2022Updated 3 years ago
- Approximate and vectorized versions of common mathematical functions☆13Mar 1, 2017Updated 9 years ago
- ☆14Feb 1, 2024Updated 2 years ago
- SODAR Core: A Django-based framework for building scientific data management web apps☆13Updated this week
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆22Mar 5, 2026Updated 2 weeks ago
- Validation for an online resource allocation algorithm for mobile edge computing☆20Dec 5, 2016Updated 9 years ago
- ☆13May 30, 2019Updated 6 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 10 months ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- LaTeX Style for a thesis or dissertation at Case Western Reserve University☆13Feb 6, 2015Updated 11 years ago
- ☆18Oct 20, 2017Updated 8 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Scientific Computing in Python, a practical and ultimate tutorials☆14Mar 7, 2023Updated 3 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- Exact and heuristic methods for tearing☆13Sep 2, 2023Updated 2 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- A Convolutional Neural Network Cascade for Face Detection☆14May 29, 2016Updated 9 years ago
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- An interactive tool for analyzing, executing, and improving dynamic programming algorithms.☆20Jan 30, 2026Updated last month
- COSMIS is a framework for quantifying the mutational constraint on amino acid sites in 3D spatial neighborhoods. The framework currently …☆16Nov 11, 2022Updated 3 years ago
- Optimally-weighted herding is Bayesian Quadrature☆16Jul 8, 2016Updated 9 years ago
- A solver that attempts to compute the best possible move sequence for a given Candy Crush Saga board.☆19Sep 13, 2013Updated 12 years ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- The interface between probabilistic model checking and data-driven policy learning.☆16Mar 11, 2026Updated last week
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago