Lunj12 / RL-Bandits-with-Knapsacks
Dynamic Pricing BwK Problem and Reinforcement Learning
☆31Updated 6 years ago
Alternatives and similar repositories for RL-Bandits-with-Knapsacks
Users that are interested in RL-Bandits-with-Knapsacks are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- ☆19Updated 3 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆12Updated 3 years ago
- ☆26Updated 4 years ago
- Official codes for "Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management: Reducing Costs and Alleviating Bullwh…☆34Updated 2 years ago
- Integration of DNN framework with Stochastic Multi-echelon Inventory Optimization (SMEIO)☆7Updated 4 years ago
- Implementation of inventory control policy parameters computation algorithms☆18Updated 3 years ago
- ☆10Updated 3 years ago
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆26Updated 6 years ago
- A Python library for addressing the supply chain inventory management problem using deep reinforcement learning algorithms.☆90Updated last year
- MIE424 Group Project: smart_predict_optimize☆14Updated 4 years ago
- Code for the paper "Smart 'Predict, then Optimize'"☆75Updated 9 months ago
- An OpenAI Gym environment for Inventory Control problems☆55Updated 5 years ago
- ☆31Updated 2 years ago
- Reinforcement Learning for Optimal inventory policy☆26Updated 3 years ago
- Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"☆31Updated 4 years ago
- Order Fulfillment by Multi-Agent Reinforcement Learning☆23Updated last year
- Code release for AAAI 2020 paper "Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems"☆38Updated 10 months ago
- Reinforcement learning environment for job scheduling written in python.☆25Updated 5 years ago
- Materials for "RL for Inventory Optimization", Day 4 of the "RL for Operations Bootcamp", Kellogg School of Management, Northwestern Univ…☆15Updated 10 months ago
- ☆9Updated last year
- ☆35Updated 5 years ago
- ☆26Updated 4 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆29Updated last year
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆14Updated 3 years ago
- Learning to Branch in Mixed Integer Linear Programming with Graph Convolutional Neural Networks in Ecole☆19Updated 2 years ago
- ☆16Updated 6 years ago
- ☆54Updated 4 months ago
- ☆39Updated last month
- inventory simulation modules for single-echelon supply chain☆13Updated 6 years ago