Lunj12 / RL-Bandits-with-KnapsacksLinks
Dynamic Pricing BwK Problem and Reinforcement Learning
☆31Updated 6 years ago
Alternatives and similar repositories for RL-Bandits-with-Knapsacks
Users that are interested in RL-Bandits-with-Knapsacks are comparing it to the libraries listed below
Sorting:
- ☆26Updated 4 years ago
- ☆14Updated last year
- Link to paper: https://www.ssrn.com/abstract=3804655☆12Updated 3 years ago
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆28Updated 6 years ago
- Integration of DNN framework with Stochastic Multi-echelon Inventory Optimization (SMEIO)☆7Updated 4 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆14Updated 3 years ago
- ☆35Updated 5 years ago
- ☆19Updated 3 years ago
- Implementation of inventory control policy parameters computation algorithms☆19Updated 3 years ago
- Bandit algorithms for dynamic pricing of many products☆42Updated 5 years ago
- Reinforcement learning environment for job scheduling written in python.☆25Updated 5 years ago
- Replication Code for Paper "Stochastic Optimization Forests".☆20Updated 3 years ago
- ☆10Updated 3 years ago
- ☆16Updated 6 years ago
- Hierarchical deep reinforcement learning for combinatorial optimization problem☆35Updated 5 years ago
- Order Fulfillment by Multi-Agent Reinforcement Learning☆23Updated last year
- Code release for AAAI 2020 paper "Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems"☆38Updated 11 months ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆16Updated 6 years ago
- Materials for "RL for Inventory Optimization", Day 4 of the "RL for Operations Bootcamp", Kellogg School of Management, Northwestern Univ…☆16Updated last year
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆31Updated last year
- Code for paper publication: Deep reinforcement learning-based solution for a multi-objective online order batching problem☆14Updated 3 years ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆95Updated 2 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆87Updated 4 years ago
- MIE424 Group Project: smart_predict_optimize☆14Updated 4 years ago
- ☆26Updated 4 years ago
- ☆25Updated 3 years ago
- A Python library for addressing the supply chain inventory management problem using deep reinforcement learning algorithms.☆91Updated last year
- ☆43Updated last month
- ☆40Updated 3 months ago
- The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems'☆85Updated 4 years ago