xkianteb/ApproPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xkianteb/ApproPO)

xkianteb / ApproPO

Reinforcement Learning with Convex Constraints

☆14

Alternatives and similar repositories for ApproPO

Users that are interested in ApproPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

miryoosefi / ConRL
View on GitHub
Constrained episodic reinforcement learning in concave-convex and knapsack settings
☆11Oct 3, 2023Updated 2 years ago
xkianteb / leaqi
View on GitHub
Active Imitation Learing with Noisy Guidance
☆10May 29, 2020Updated 6 years ago
xkianteb / dril
View on GitHub
Disagreement-Regularized Imitation Learning
☆30May 25, 2021Updated 5 years ago
ywnch / buskit
View on GitHub
A simple bus simulation environment for bus bunching analysis in New York City
☆14Oct 7, 2018Updated 7 years ago
brian-rose / notebook_diff_tutorial
View on GitHub
Examples for comparing and merging versions of Jupyter notebooks
☆13Sep 5, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shuoyang2000 / neural_hybrid_cbf
View on GitHub
Code for "Learning Local Control Barrier Functions for Safety Control of Hybrid Systems"
☆14Jan 29, 2024Updated 2 years ago
MIT-REALM / dcrl
View on GitHub
Density Constrained Reinforcement Learning
☆12Mar 24, 2023Updated 3 years ago
robin-vjc / nsopy
View on GitHub
Methods for Non-Smooth Convex Optimization (NSO), written in Python
☆30Feb 6, 2024Updated 2 years ago
uumami / mhar
View on GitHub
mhar
☆22Feb 15, 2024Updated 2 years ago
ATayebi / HybridAngleControl-HAC
View on GitHub
Implementation of Grid-Forming HAC for Converter Connected to an Infinite Bus
☆17Feb 10, 2021Updated 5 years ago
pmineiro / fastapprox
View on GitHub
Approximate and vectorized versions of common mathematical functions
☆13Mar 1, 2017Updated 9 years ago
junekihong / beam-span-parser
View on GitHub
A DP beam-search extension of Mitchell Stern's span-based neural constituency parser
☆11Aug 24, 2022Updated 3 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
AdityaMate / collapsing_bandits
View on GitHub
Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)
☆11Dec 3, 2025Updated 7 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zackchase / intrinsic-fear-dqn
View on GitHub
Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.
☆10Nov 13, 2017Updated 8 years ago
VersBinarii / pomia-rs
View on GitHub
STM32 + Rust + RTIC embedded project
☆21Jan 5, 2021Updated 5 years ago
andersjo / dependency_decoding
View on GitHub
Chu-Lui-Edmonds decoding extracted from TurboParser
☆14May 16, 2017Updated 9 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
rjagerman / wsdm2019-nonstationary
View on GitHub
Non-stationary Off-policy Evaluation
☆13Nov 8, 2018Updated 7 years ago
thiagopbueno / tf-mdp
View on GitHub
Probabilistic planning in continuous state-action MDPs in TensorFlow.
☆13Jun 21, 2022Updated 4 years ago
matejbalog / gumbel-relatives
View on GitHub
Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick
☆17Jun 14, 2017Updated 9 years ago
gsastry / human-rl
View on GitHub
Code for human intervention reinforcement learning
☆35Jan 8, 2018Updated 8 years ago
baharev / sdopt-tearing
View on GitHub
Exact and heuristic methods for tearing
☆14Sep 2, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ievron / RegularizationAnimation
View on GitHub
☆11Dec 27, 2021Updated 4 years ago
txzhao / rl-zoo
View on GitHub
PyTorch implementation of various reinforcement learning algorithms
☆18Feb 22, 2018Updated 8 years ago
duvenaud / herding-paper
View on GitHub
Optimally-weighted herding is Bayesian Quadrature
☆17Jul 8, 2016Updated 10 years ago
ming93 / Safe_reinforcement_learning
View on GitHub
Convergent Policy Optimization for Safe Reinforcement Learning
☆11Oct 26, 2019Updated 6 years ago
keskarnitish / NQN
View on GitHub
A Limited-Memory Quasi-Newton Algorithm for Bound-Constrained Nonsmooth Optimization
☆13Dec 23, 2016Updated 9 years ago
oval-group / dfw
View on GitHub
Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch
☆63Mar 6, 2021Updated 5 years ago
rems75 / SPIBB-DQN
View on GitHub
Code for SPIBB-DQN and Soft-SPIBB-DQN
☆11May 5, 2020Updated 6 years ago
aschein / bptd
View on GitHub
Bayesian Poisson Tucker decomposition
☆17Mar 17, 2017Updated 9 years ago
shuaizhao95 / ICLAttack
View on GitHub
ICL backdoor attack
☆17Nov 4, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lianghuang3 / lineardpparser
View on GitHub
linear-time dynamic programming dependency parser
☆11Feb 2, 2019Updated 7 years ago
vermouth1992 / safe_rl_papers
View on GitHub
A list of safe reinforcement learning papers
☆21Jan 9, 2020Updated 6 years ago
gabb7 / AReS-MaRS
View on GitHub
Python 3.6 and TensorFlow implementation of the AReS and MaRS algorithms
☆11Jun 23, 2019Updated 7 years ago
Sea-Snell / CALM-Dialogue
View on GitHub
Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
☆34Dec 9, 2022Updated 3 years ago
acguez / bamcp
View on GitHub
Bayes-Adaptive Monte-Carlo Planning algorithm
☆19Mar 5, 2013Updated 13 years ago
lmzintgraf / MultiMAuS
View on GitHub
Simulator for online credit card transactions with multi-modal authentication
☆21Nov 7, 2017Updated 8 years ago
jiaqima / Off-Policy-2-Stage
View on GitHub
Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130
☆30Jun 11, 2020Updated 6 years ago