CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆11Updated 6 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting:
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆20Updated 6 years ago
- ☆26Updated 4 years ago
- ☆20Updated 5 years ago
- ☆10Updated 4 years ago
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆25Updated 6 years ago
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆37Updated 8 years ago
- Integration of DNN framework with Stochastic Multi-echelon Inventory Optimization (SMEIO)☆8Updated 3 years ago
- qmix☆22Updated 4 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆25Updated 5 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆29Updated 6 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆46Updated 6 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆17Updated 8 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Multiagent deep reinforcement learning research project☆27Updated 7 months ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 4 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11Updated 3 years ago
- meta-MADDPG (Python implementation)☆18Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆45Updated 4 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆13Updated 3 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆15Updated 6 years ago
- PyTorch implementation of various reinforcement learning algorithms☆18Updated 6 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 6 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- Multi Agent Deep Reinforcement Learning for Local Flexibility Markets- Master Thesis☆11Updated 4 years ago
- Reinforcement Learning (RL), allows you to develop smart, quick and self-learning systems in your business surroundings. It is an effecti…☆11Updated 5 years ago
- Materials for "RL for Inventory Optimization", Day 4 of the "RL for Operations Bootcamp", Kellogg School of Management, Northwestern Univ…☆15Updated 6 months ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 6 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆32Updated 7 years ago