Unified notation for Markov Decision Processes PO(MDP)s
☆24Apr 27, 2018Updated 7 years ago
Alternatives and similar repositories for MDPN
Users that are interested in MDPN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A varitation graph tool☆10Dec 23, 2019Updated 6 years ago
- ☆27Mar 11, 2025Updated last year
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Aug 14, 2021Updated 4 years ago
- ☆29Mar 1, 2026Updated last month
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and data for the CoNLL 2018 paper "Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge."☆25Jan 21, 2019Updated 7 years ago
- Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library☆18May 22, 2018Updated 7 years ago
- Recurrent Neural Network library for Torch7's nn☆19Jan 26, 2017Updated 9 years ago
- Recurrent Convolutional Memory Network (in progress)☆29Apr 16, 2016Updated 10 years ago
- ☆114Aug 6, 2024Updated last year
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- An environment for tabular Reinforcement Learning agents.☆14Jun 13, 2018Updated 7 years ago
- Bannerlord mod that allows the player to marry faction leaders and companions.☆11Nov 1, 2025Updated 5 months ago
- Identifing disease types in images of rice grown in Egypt.☆26Aug 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Source code for the Self-Paced Deep Reinforcement Learning Experiments☆31Mar 24, 2023Updated 3 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Mar 24, 2023Updated 3 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11May 19, 2021Updated 4 years ago
- Open source Java framework to create, process and manage mixtures of exponential family☆14Aug 4, 2015Updated 10 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆29Oct 23, 2025Updated 5 months ago
- This repository contains my models that has been trained to translate from kikuyu to kiswahili. It also contains the dataset used for the…☆14Sep 10, 2018Updated 7 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆38Oct 14, 2020Updated 5 years ago
- Repository for the paper "An Adversarial Approach for the Robust Classification of Pneumonia from Chest Radiographs"☆19Jan 14, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Jun 7, 2024Updated last year
- Evaluating methods to improve model transfer for intensive care unit models☆16Jul 6, 2023Updated 2 years ago
- ☆17Mar 21, 2021Updated 5 years ago
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- ☆13Aug 19, 2024Updated last year
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)☆40Feb 7, 2019Updated 7 years ago
- Wikipedia navigation environment for OpenAI Gym☆42Apr 2, 2023Updated 3 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆62Aug 9, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- # ParlAI Agent examples with PyTorch, Chainer and TensorFlow☆46Jan 19, 2018Updated 8 years ago
- Github for the NIPS 2020 paper "Learning outside the black-box: at the pursuit of interpretable models"☆14Sep 7, 2022Updated 3 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆20Jan 11, 2023Updated 3 years ago
- A bot that can send messages, images, and stickers to LINE.☆10Jan 3, 2022Updated 4 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- Official repo of "Towards Interpretable Protein Structure Prediction with Sparse Autoencoders" published at ICLR 2025 GEM workshop.☆16Mar 13, 2025Updated last year