CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-settingLinks
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Updated 7 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
Sorting:
- Solving POMDPs using exact and approximate methods☆14Updated 7 years ago
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆28Updated 6 years ago
- ☆20Updated 5 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Updated 5 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆31Updated 6 years ago
- ☆17Updated 3 years ago
- ☆26Updated 4 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆22Updated 6 years ago
- TF-Tile: an efficient sparse representation for real-valued data☆14Updated 2 years ago
- MIE424 Group Project: smart_predict_optimize☆14Updated 4 years ago
- Code release for AAAI 2020 paper "Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems"☆39Updated last year
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆38Updated 9 years ago
- ☆10Updated 4 years ago
- Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes☆23Updated 6 years ago
- Reinforcement Learning (RL), allows you to develop smart, quick and self-learning systems in your business surroundings. It is an effecti…☆12Updated 5 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- Package for building Market Segmentation Trees, Choice Model Trees, and Isotonic Regression Trees☆17Updated 2 years ago
- This is the source code of the paper "Inferring Complementary Products from Baskets and Browsing Sessions"☆11Updated 6 years ago
- Implementing Algorithms for Computing Stackelberg Equilibria in Security Games☆42Updated 8 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆14Updated 3 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆17Updated 7 years ago
- Materials for "RL for Inventory Optimization", Day 4 of the "RL for Operations Bootcamp", Kellogg School of Management, Northwestern Univ…☆16Updated last year
- Reference implementations of (my) control algorithms for Markov Jump Linear Systems without mode observation.☆13Updated 8 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 8 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆16Updated 6 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆31Updated last year
- Link to paper: https://www.ssrn.com/abstract=3804655☆13Updated 3 years ago
- Reinforcement Learning for Optimal inventory policy☆30Updated 3 years ago