Dynamic Pricing BwK Problem and Reinforcement Learning
☆31Dec 11, 2018Updated 7 years ago
Alternatives and similar repositories for RL-Bandits-with-Knapsacks
Users that are interested in RL-Bandits-with-Knapsacks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bandit algorithms for dynamic pricing of many products☆42Nov 5, 2019Updated 6 years ago
- Reinforcement Learning for Supply Chain Optimization☆15Feb 3, 2020Updated 6 years ago
- ☆38Mar 28, 2022Updated 4 years ago
- ☆14Jun 8, 2023Updated 3 years ago
- ☆23Nov 17, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Mar 19, 2026Updated 3 months ago
- Dynamic pricing for selling perishable goods☆65Dec 7, 2017Updated 8 years ago
- In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…☆90May 27, 2026Updated 3 weeks ago
- Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire☆33Feb 2, 2023Updated 3 years ago
- [ICML 25] "Preference Optimization for Combinatorial Optimization Problems"☆28Jun 6, 2025Updated last year
- python openflow library☆14Oct 17, 2018Updated 7 years ago
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆22Dec 7, 2024Updated last year
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Thesis on Single-Agent Dynamic Pricing with Reinforcement Learning☆24Jun 14, 2019Updated 7 years ago
- Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)☆19Jan 1, 2023Updated 3 years ago
- Simulation Framework for Dynamic Pricing Competitions☆20Aug 24, 2018Updated 7 years ago
- EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks☆18Jan 10, 2020Updated 6 years ago
- Reinforcement Learning, Tutorials in Chinese☆11Jun 9, 2018Updated 8 years ago
- Environments for OR and RL Research☆445Oct 12, 2023Updated 2 years ago
- This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…☆14Jan 19, 2023Updated 3 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- Bootstrap (Linear) Thompson Sampling☆13Jun 30, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Jan 31, 2022Updated 4 years ago
- ☆10Jun 7, 2021Updated 5 years ago
- ☆41Apr 17, 2022Updated 4 years ago
- Validation for an online resource allocation algorithm for mobile edge computing☆20Dec 5, 2016Updated 9 years ago
- Scrape LinkedIn posts and content based on keywords☆17Oct 20, 2021Updated 4 years ago
- This is the source code for our (Matthias Jasny, Lasse Thostrup, Tobias Ziegler and Carsten Binnig) published paper at SIGMOD’22: P4DB - …☆13Jan 24, 2023Updated 3 years ago
- Three Agent-Based Simulation for Edge Computing in 5G and Beyond for the recent paper titled "Design and Simulation of a Hybrid Architect…☆20Oct 26, 2021Updated 4 years ago
- Pytorch Implementation for KDD22 paper "Multi-Agent Graph Convolutional Reinforcement Learning for Dynamic Electric Vehicle Charging Pric…☆76Mar 9, 2023Updated 3 years ago
- The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems'☆83May 27, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- ☆22Jul 20, 2022Updated 3 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆103Dec 14, 2021Updated 4 years ago
- Julia interface for the Quadratic Programming solver DAQP☆13Mar 28, 2026Updated 2 months ago
- Energy-Efficient Power and Subcarrier Allocation for OFDMA Systems with Value Function Approximation Approach. EI paper from march to sep…☆13Mar 13, 2017Updated 9 years ago
- Code implementation of "Information Design in Multi-Agent Reinforcement Learning"☆16Aug 18, 2023Updated 2 years ago
- Source code of paper Combinatorial Optimization Meets Reinforcement Learning: Effective Taxi Order Dispatching at Large-Scale (TKDE 2022)…☆26Apr 7, 2022Updated 4 years ago