CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-settingLinks
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Updated 7 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
Sorting:
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆27Updated 6 years ago
- ☆26Updated 4 years ago
- ☆20Updated 5 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆16Updated 6 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆31Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- ☆10Updated 4 years ago
- Hierarchical deep reinforcement learning for combinatorial optimization problem☆35Updated 5 years ago
- ☆11Updated 5 years ago
- ☆16Updated 6 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆27Updated 5 years ago
- qmix☆22Updated 5 years ago
- Reinforcement Learning (RL), allows you to develop smart, quick and self-learning systems in your business surroundings. It is an effecti…☆11Updated 5 years ago
- Implementing Algorithms for Computing Stackelberg Equilibria in Security Games☆42Updated 8 years ago
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆15Updated 2 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆22Updated 6 years ago
- Solves a Mixed Integer Linear Program to generate the Stacklberg Equilibrium of a General-sum (+Bayesian) Games.☆36Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆23Updated 3 years ago
- ☆17Updated 3 years ago
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆38Updated 9 years ago
- Fuzzy Deep Reinforcement Learning for autoscaling on clouds☆9Updated 4 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- FEN Code☆37Updated 5 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆10Updated 11 months ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆12Updated 3 years ago
- Materials for "RL for Inventory Optimization", Day 4 of the "RL for Operations Bootcamp", Kellogg School of Management, Northwestern Univ…☆16Updated 11 months ago