dtak / hip-mdp-publicView external linksLinks
Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆30Dec 28, 2017Updated 8 years ago
Alternatives and similar repositories for hip-mdp-public
Users that are interested in hip-mdp-public are comparing it to the libraries listed below
Sorting:
- Code and webpages for our study on teaching humans to defer to an AI☆12Nov 6, 2023Updated 2 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- ☆11Jul 1, 2019Updated 6 years ago
- Code for "Neural Network-based Reconstruction in Compressed Sensing MRI Without Fully-sampled Training Data"☆13Jan 5, 2021Updated 5 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- The Unsupervised Phenome Model (UPhenome) for learning phenotypes on heterogeneous EHR data☆16Dec 9, 2015Updated 10 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 7 years ago
- Implementation of the Gaussian processes regression with inducing points for online data with ensemble Kalman filter estimation. Code for…☆17Jul 9, 2018Updated 7 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- Simple gym environments for safety in Reinforcement Learning Research☆18Jul 17, 2024Updated last year
- Code for the paper "A Fully Hyperbolic Neural Model for Hierarchical Multi-class Classification"☆17Nov 17, 2020Updated 5 years ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆21Nov 29, 2025Updated 2 months ago
- A recurrent neural network heavily inspired by Long Short Term Memory, but simpler.☆21May 4, 2013Updated 12 years ago
- Mobile Edge Computing Hierarchical Model which has the mobile as the edge server and the cloud as the central server. The load balancing …☆15May 5, 2018Updated 7 years ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆20Jan 4, 2026Updated last month
- ☆20Nov 8, 2022Updated 3 years ago
- Template-DQN and DRRN agent implementations☆22Jun 12, 2023Updated 2 years ago
- Software for selective inference☆18Apr 25, 2023Updated 2 years ago
- Multi-variable LSTM recurrent neural networks for prediction and interpretation of multi-variable time series☆49Jun 18, 2021Updated 4 years ago
- Code for HypMix EMNLP 2021 (main)☆24Oct 4, 2021Updated 4 years ago
- PyTorch implementation of various reinforcement learning algorithms☆18Feb 22, 2018Updated 7 years ago
- OncoText is an information extraction service for breast pathology reports. It supports over 20 categories including DCIS, includes pretr…☆24Oct 22, 2018Updated 7 years ago
- Resilient IoT Data Exchange (RIDE) using SDN and edge computing. This repository includes the algorithms, prototype implementation, SDN …☆23Mar 6, 2021Updated 4 years ago
- Public code for implementation and experiments with differentiable decision trees.☆32Oct 17, 2024Updated last year
- Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…☆22Feb 27, 2022Updated 3 years ago
- NDN caching simulator for edge computing using python☆23Jul 13, 2025Updated 7 months ago
- Embedding graphs in symmetric spaces☆30Sep 30, 2021Updated 4 years ago
- Modelling epidemiological dynamics and performing inference in these models☆27Jul 30, 2021Updated 4 years ago
- Chainer implementation of Double Deep Q-Network (Double DQN)☆27Mar 30, 2016Updated 9 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆55Jul 25, 2016Updated 9 years ago
- GA,PSO,LSTM...☆26May 11, 2018Updated 7 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago