CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Updated 7 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
Sorting:
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆26Updated 6 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆31Updated 6 years ago
- ☆26Updated 4 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆22Updated 6 years ago
- ☆20Updated 5 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆26Updated 5 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆12Updated 3 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Multiagent deep reinforcement learning research project☆28Updated 11 months ago
- The released model of the paper 'Automatic Bridge Bidding by Deep Reinforcement Learning' in ECAI 2016☆19Updated 8 years ago
- Materials for "RL for Inventory Optimization", Day 4 of the "RL for Operations Bootcamp", Kellogg School of Management, Northwestern Univ…☆15Updated 10 months ago
- research and implementations of Deep RL agents and their applications☆50Updated 2 weeks ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- Feature selection for maximizing expected cumulative reward☆30Updated 7 years ago
- Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022☆22Updated 3 years ago
- ☆11Updated 5 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- Integration of DNN framework with Stochastic Multi-echelon Inventory Optimization (SMEIO)☆7Updated 4 years ago
- A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Lea…☆27Updated last year
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆38Updated 9 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- Reinforcement Learning (RL), allows you to develop smart, quick and self-learning systems in your business surroundings. It is an effecti…☆11Updated 5 years ago
- PyTorch implementation of various reinforcement learning algorithms☆18Updated 7 years ago
- TF-Tile: an efficient sparse representation for real-valued data☆14Updated 2 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆16Updated 6 years ago
- Some example code for the "Introduction to Bayesian Reinforcement Learning" presentations☆29Updated 6 years ago