☆27Oct 25, 2019Updated 6 years ago
Alternatives and similar repositories for constrained_batch_policy_learning
Users that are interested in constrained_batch_policy_learning are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- ☆15Oct 4, 2022Updated 3 years ago
- Implementing the Learning with Opponent Learning Awareness paper (https://blog.openai.com/learning-to-model-other-minds/)☆19Jul 20, 2018Updated 7 years ago
- Factored model-based Bayesian Reinforcement Learning framework☆22Nov 23, 2022Updated 3 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- An evolutionary algorithm-based optimization for tracking weights in the OpenSim Residual Reduction Algorithm (RRA).☆11Jul 17, 2023Updated 2 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- ☆24Oct 22, 2015Updated 10 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆31Nov 21, 2018Updated 7 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆657Apr 6, 2021Updated 4 years ago
- workspace comprising demo packages for our roscon2018 talk☆10Dec 21, 2019Updated 6 years ago
- simd enabled column imprints☆11Feb 12, 2018Updated 8 years ago
- Scala発火村の資料ですお☆30Oct 18, 2010Updated 15 years ago
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆10Oct 6, 2022Updated 3 years ago
- ☆11May 13, 2021Updated 4 years ago
- ☆16Dec 6, 2014Updated 11 years ago
- The SOLAR blackbox optimization problem☆16Sep 24, 2025Updated 5 months ago
- SymPy based framework for optimized code generation for BSSN formulation of Einstein equation for heterogeneous platforms.☆11Aug 18, 2025Updated 6 months ago
- Autonomous Car System Integration Project on Udacity SDC ND☆11Dec 5, 2017Updated 8 years ago
- ☆13May 30, 2019Updated 6 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆11Feb 9, 2023Updated 3 years ago
- ☆10Apr 13, 2020Updated 5 years ago
- ☆12Jun 25, 2021Updated 4 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 9 months ago
- IOManager tries to bridge the gap in existing async framework to build full async networked database/storage/keyvalue storage☆11Feb 7, 2026Updated 3 weeks ago
- ros2 differential drive robot☆10Jan 14, 2021Updated 5 years ago
- Just a package with lots of files and testing stuff with moveit and grasping related things with REEM☆11Feb 4, 2016Updated 10 years ago
- Code used in the paper "On dynamic succinct graph representations".☆11Sep 2, 2021Updated 4 years ago
- BinDex: A Two-Layered Index for Fast and Robust Scans (SIGMOD2020)☆10Jun 5, 2020Updated 5 years ago
- Simple, open source utility to convert CSV/TSV files to RDF☆14Aug 6, 2014Updated 11 years ago
- neuralpy - neural network library written in python☆12Jun 25, 2023Updated 2 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 10 years ago
- ☆11Jun 5, 2023Updated 2 years ago
- ☆13Jan 21, 2022Updated 4 years ago