Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆29Dec 28, 2017Updated 8 years ago
Alternatives and similar repositories for hip-mdp-public
Users that are interested in hip-mdp-public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Unsupervised Phenome Model (UPhenome) for learning phenotypes on heterogeneous EHR data☆16Dec 9, 2015Updated 10 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- Code and webpages for our study on teaching humans to defer to an AI☆12Nov 6, 2023Updated 2 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- ☆11Jul 1, 2019Updated 6 years ago
- ☆69May 26, 2018Updated 7 years ago
- IOT Communication for Satellite tracking system☆12Sep 3, 2019Updated 6 years ago
- Deepmind Recurrent Environment Simulators paper implementation in tensorflow☆74Feb 2, 2018Updated 8 years ago
- Code for "Neural Network-based Reconstruction in Compressed Sensing MRI Without Fully-sampled Training Data"☆12Jan 5, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Relational Features for Planning☆15Mar 27, 2026Updated 3 weeks ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- ☆10Jun 10, 2020Updated 5 years ago
- Code for "Convergence of Learning Dynamics in Stackelberg Games"☆13Nov 6, 2019Updated 6 years ago
- A set of Deep Reinforcement Learning Agents implemented in Tensorflow.☆13Feb 5, 2017Updated 9 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 8 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…☆22Feb 27, 2022Updated 4 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆22Nov 29, 2025Updated 4 months ago
- Template-DQN and DRRN agent implementations☆22Jun 12, 2023Updated 2 years ago
- Repository for tutorial on Neural ODEs prepared for the UCL AI Society☆13Mar 7, 2021Updated 5 years ago
- TensorFlow-KR 그룹에서는 논문읽기 모임인 PR12이 진행되고 있는데요. 여기서 다뤄지고 있는 모델을 케라스로 구현하고자 합니다.☆17Feb 4, 2018Updated 8 years ago
- ☆10Jun 20, 2025Updated 9 months ago
- Code for the paper "A Fully Hyperbolic Neural Model for Hierarchical Multi-class Classification"☆16Nov 17, 2020Updated 5 years ago
- Design and Implement a Satellite Communications System for Poor Quality Satellite Communication Channels using MATLAB☆24Mar 24, 2020Updated 6 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Modelling epidemiological dynamics and performing inference in these models☆27Jul 30, 2021Updated 4 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆52Jul 25, 2016Updated 9 years ago
- A boilerplate (dbs, envs, teleop, models, web-apps) for robotic learning experiments & a Pytorch Implementation of "Learning Latent Plans…☆11Oct 23, 2020Updated 5 years ago
- A helper package to get information of scholarly articles from DBLP using its public API☆16May 13, 2025Updated 11 months ago
- InspectOmop is a lightweight python 3 package that assists in the extraction of electronic health record(EHR) data from relational databa…☆15Updated this week
- Mobile Edge Computing Hierarchical Model which has the mobile as the edge server and the cloud as the central server. The load balancing …☆15May 5, 2018Updated 7 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago