Bayes-Adaptive Monte-Carlo Planning algorithm
☆17Mar 5, 2013Updated 13 years ago
Alternatives and similar repositories for bamcp
Users that are interested in bamcp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"☆17May 9, 2022Updated 3 years ago
- A toolkit for working with RDDL domains in Python3.☆17Nov 7, 2020Updated 5 years ago
- Model-based reinforcement learning in TensorFlow☆56Jul 27, 2021Updated 4 years ago
- Julia Implementation of the POMCP algorithm for solving POMDPs☆12Aug 6, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A community repository for benchmarking Bayesian methods☆11May 25, 2023Updated 2 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Factored model-based Bayesian Reinforcement Learning framework☆22Nov 23, 2022Updated 3 years ago
- The PO-UCT algorithm (aka POMCP) implemented in Julia☆37Nov 16, 2025Updated 4 months ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆11Jul 15, 2022Updated 3 years ago
- Code for Deep Structured Mixtures of Gaussian Processes (DSMGPs)☆11Jan 27, 2022Updated 4 years ago
- ☆13May 30, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 10 months ago
- [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003☆10Oct 6, 2022Updated 3 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time …☆15Mar 27, 2018Updated 7 years ago
- Code for "On the Expressiveness of Approximate Inference in Bayesian Neural Networks"☆13Aug 16, 2021Updated 4 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Jan 16, 2023Updated 3 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A tour of Pomdpland☆10Aug 10, 2022Updated 3 years ago
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- This repo contains the scripts used to create the data for the ATC2020 paper "Reconstructing proprietary video streaming algorithms"☆14Mar 24, 2021Updated 5 years ago
- A web page to collect reproduced papers in one place with their codes☆14Mar 8, 2023Updated 3 years ago
- Examples of Verbalized Machine Learning (VML)☆16Mar 16, 2025Updated last year
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Dec 8, 2022Updated 3 years ago
- ☆11Dec 27, 2021Updated 4 years ago
- Repository containing the PhD Thesis "Formal Verification of Deep Reinforcement Learning Agents"☆11Aug 29, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Platform for training generalizable deep reinforcement learning agents☆13Mar 4, 2026Updated 3 weeks ago
- Real-time Bandwidth Prediction based on LSTM☆10Mar 19, 2025Updated last year
- ☆11Apr 7, 2021Updated 4 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- A downloadable pdf containing summary of frequently used pandas operations.☆10Sep 26, 2020Updated 5 years ago
- This is the official repository for "DiffSG: A Generative Solver for Network Optimization with Diffusion Model" and "Diffusion Models as …☆19Feb 10, 2025Updated last year
- Tool kit for the Mujoco user☆14May 12, 2020Updated 5 years ago