Markov Decision Processes, Dynamic Programming and Reinforcement Learning
☆306Feb 22, 2021Updated 5 years ago
Alternatives and similar repositories for MDP-DP-RL
Users that are interested in MDP-DP-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Technical documents on a variety of topics, created for the purpose of learning☆63Mar 15, 2026Updated 3 months ago
- Markov Decision Process (MDP) Toolbox for Python☆556Jun 4, 2023Updated 3 years ago
- Python code for Markov decision processes☆20Nov 5, 2018Updated 7 years ago
- Approximate Dynamic Programming and Reinforcement Learning - Programming Assignment☆10Jun 21, 2019Updated 6 years ago
- A MATLAB Toolbox for Solving Markov Decision Problems with Dynamic Programming☆35Nov 2, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Approximate Dynamic Programming for Portfolio Selection Problem☆56Dec 8, 2022Updated 3 years ago
- ☆11Aug 25, 2022Updated 3 years ago
- Conflict avoidance algorithm for unmanned aircraft traffic management☆10May 30, 2017Updated 9 years ago
- Multi-objective vehicle routing problem with soft time window constraints☆10May 30, 2022Updated 4 years ago
- Macro-Action Generator-Critic (MAGIC) - Learning Macro-actions for online POMDP planning☆17Feb 23, 2023Updated 3 years ago
- A DQN agent that optimally hedges an options portfolio.☆25Feb 4, 2020Updated 6 years ago
- Python implementation of the basic model described in Chan, Nicholas Tung, and Christian Shelton. "An electronic market-maker."☆17Aug 27, 2023Updated 2 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆21Jul 27, 2020Updated 5 years ago
- "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.☆13Dec 8, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A C++ framework for MDPs and POMDPs with Python bindings☆671Mar 18, 2025Updated last year
- Experiments for designing tree search algorithms for Continuous POMDPs☆20Nov 7, 2020Updated 5 years ago
- Approximate Dynamic Programming exercises from Powell (2011)☆15Jul 6, 2023Updated 2 years ago
- Adaptation of Monte Carlo and SARSA algorithms (Reinforcement Learning) for learning the policy of sellers/ buyers in stock market☆12Jul 23, 2018Updated 7 years ago
- Code for Deep Neural Network Approximated Dynamic Programming☆17Jul 14, 2020Updated 5 years ago
- A theano-based LSTM-RNN for trajectory prediction.☆11May 16, 2018Updated 8 years ago
- Extensions to Philadelphia☆17Updated this week
- Predictive Maintenance avoids the drawbacks of Preventive Maintenance (under utilization of a part's life) and Reactive Maintenance (unsc…☆17Apr 25, 2022Updated 4 years ago
- Code for my Master's thesis, game theory for adversarial autonomous vehicle platooning scenarios☆15Apr 28, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A generalized experience replay buffer for reinforcement learning☆10Apr 4, 2025Updated last year
- ☆50Jul 22, 2020Updated 5 years ago
- Deep Recurrent Q-Learning vs Deep Q Learning on a simple Partially Observable Markov Decision Process with Minecraft☆48Apr 12, 2019Updated 7 years ago
- Utilizing dynamic programming, I optimally schedule electric battery usage for a plug-in hybrid electric vehicle to minimize fuel consump…☆48Jan 17, 2019Updated 7 years ago
- Online solver based on Monte Carlo tree search for POMDPs with continuous state, action, and observation spaces.☆60Nov 16, 2025Updated 7 months ago
- Option hedging strategies are investigated using two reinforcement learning algorithms: deep Q network and deep deterministic policy grad…☆20Nov 28, 2019Updated 6 years ago
- Deep Q-Learning for Market Making☆130Jun 12, 2018Updated 8 years ago
- ☆13Jul 25, 2019Updated 6 years ago
- Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.☆21Nov 9, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Battery charge management environment, designed as a multi-agent scenario with continuous observation and action space, where the agents …☆13Feb 9, 2021Updated 5 years ago
- ☆35Dec 9, 2021Updated 4 years ago
- 2 algorithms of optimal trade execution: 1) Dynamic Programming 2) Frank-Wolfe Algorithm (Python & C++)☆19Dec 11, 2019Updated 6 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Oct 29, 2017Updated 8 years ago
- Prediction of Remaining Useful Life (RUL) of NASA Turbofan Jet Engine using libraries such as Numpy, Matplotlib and Pandas. Prediction is…☆10Apr 18, 2021Updated 5 years ago
- A HMM application in Kritzman Regime Detection☆15Jan 3, 2020Updated 6 years ago
- Collection of business analytics case studies that leverage data science methods to create business value (R and Python)☆13Jul 12, 2019Updated 6 years ago