Reinforcement Learning with Convex Constraints
☆14Apr 6, 2022Updated 3 years ago
Alternatives and similar repositories for ApproPO
Users that are interested in ApproPO are comparing it to the libraries listed below
Sorting:
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Safe Reinforcement Learning in Constrained Markov Decision Processes☆61Jul 21, 2020Updated 5 years ago
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- Methods for Non-Smooth Convex Optimization (NSO), written in Python☆29Feb 6, 2024Updated 2 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Dec 9, 2022Updated 3 years ago
- Code for human intervention reinforcement learning☆35Jan 8, 2018Updated 8 years ago
- A collection of useful, free, single-file libraries for C.☆11Oct 15, 2015Updated 10 years ago
- ☆10Oct 3, 2023Updated 2 years ago
- ☆11Sep 22, 2019Updated 6 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- Code and data for EMNLP2019 Paper "Uncover the Ground-Truth Relations in Distant Supervision: A Neural Expectation-Maximization Framework…☆10May 24, 2020Updated 5 years ago
- ☆11Feb 23, 2026Updated last week
- FPsolve: solver for polynomial equations over omega-continuous semirings☆11Aug 15, 2015Updated 10 years ago
- SentiStorm - Real-time Twitter Sentiment Classification based on Apache Storm☆10May 22, 2018Updated 7 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 9 months ago
- Evaluating variational inference using Pareto-smoothed importance sampling and simulation-based calibration☆12Jun 8, 2018Updated 7 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 2 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- ☆14Sep 30, 2022Updated 3 years ago
- Lecture notes for CSC2421☆10Jan 8, 2023Updated 3 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.☆11Jul 12, 2018Updated 7 years ago
- A simple bus simulation environment for bus bunching analysis in New York City☆13Oct 7, 2018Updated 7 years ago
- Package to evaluate network-propagation-based similarity measures between nodes in a networkX graph☆10Jul 31, 2018Updated 7 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- ☆10Jun 28, 2015Updated 10 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Jan 18, 2016Updated 10 years ago
- Probabilistic planning in continuous state-action MDPs in TensorFlow.☆13Jun 21, 2022Updated 3 years ago
- ☆14Feb 1, 2024Updated 2 years ago
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- A DP beam-search extension of Mitchell Stern's span-based neural constituency parser☆11Aug 24, 2022Updated 3 years ago
- hanabi_learning_environment is a research platform for Hanabi experiments.☆11May 17, 2022Updated 3 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Python 3.6 and TensorFlow implementation of the AReS and MaRS algorithms☆11Jun 23, 2019Updated 6 years ago
- Experiments studying ensemble methods for stock portfolio selection☆15Oct 4, 2017Updated 8 years ago
- Contextual bandit benchmarking☆53Jan 21, 2026Updated last month
- This is all the codes used in "Large Scale Online Kernel Learning"☆11Aug 8, 2017Updated 8 years ago
- ☆12Aug 13, 2022Updated 3 years ago