Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆29Dec 28, 2017Updated 8 years ago
Alternatives and similar repositories for hip-mdp-public
Users that are interested in hip-mdp-public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- Code and webpages for our study on teaching humans to defer to an AI☆12Nov 6, 2023Updated 2 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 7 years ago
- The code is used to Plan UAV base stations and optimize their location for wider coverage in 5G and beyond Networks☆11Jul 7, 2022Updated 3 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 5 years ago
- ☆11Jul 1, 2019Updated 6 years ago
- ☆69May 26, 2018Updated 8 years ago
- A python framework for Optimal Planning Modulo Theories☆12Jan 26, 2024Updated 2 years ago
- Deepmind Recurrent Environment Simulators paper implementation in tensorflow☆74Feb 2, 2018Updated 8 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Simple gym environments for safety in Reinforcement Learning Research☆18Jul 17, 2024Updated last year
- Relational Features for Planning☆15Mar 27, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- ☆10Jun 10, 2020Updated 6 years ago
- A set of Deep Reinforcement Learning Agents implemented in Tensorflow.☆13Feb 5, 2017Updated 9 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 8 years ago
- Code that translates grammar into PDDL, runs a planner to produce multiple plans, translates plans into trainable lale pipelines and trai…☆19Sep 17, 2025Updated 9 months ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…☆22Feb 27, 2022Updated 4 years ago
- Memory-augmented Attention Modelling for Videos☆10Apr 24, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆23Nov 29, 2025Updated 6 months ago
- Template-DQN and DRRN agent implementations☆22Jun 12, 2023Updated 3 years ago
- Design and Implement a Satellite Communications System for Poor Quality Satellite Communication Channels using MATLAB☆24Mar 24, 2020Updated 6 years ago
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- Probabilistic single-cell pseudotime with Edward+Tensorflow☆12Oct 5, 2017Updated 8 years ago
- Code repo for "Collapsing Bandits and Their Applications to Public Health Interventions", (NeurIPS'20)☆11Dec 3, 2025Updated 6 months ago
- Modelling epidemiological dynamics and performing inference in these models☆27Jul 30, 2021Updated 4 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆31Dec 10, 2022Updated 3 years ago
- Implementation of the Gaussian processes regression with inducing points for online data with ensemble Kalman filter estimation. Code for…☆16Jul 9, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆52Jul 25, 2016Updated 9 years ago
- ☆17May 1, 2021Updated 5 years ago
- InspectOmop is a lightweight python 3 package that assists in the extraction of electronic health record(EHR) data from relational databa…☆15Apr 14, 2026Updated 2 months ago
- Mobile Edge Computing Hierarchical Model which has the mobile as the edge server and the cloud as the central server. The load balancing …☆15May 5, 2018Updated 8 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- ☆20Nov 8, 2022Updated 3 years ago
- implementation of semi-supervised VAE using pytorch☆11Nov 5, 2019Updated 6 years ago