CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Updated 7 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting:
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
- ☆26Updated 4 years ago
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆26Updated 6 years ago
- ☆20Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆15Updated 6 years ago
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆38Updated 9 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆21Updated 6 years ago
- Reinforcement Learning (RL), allows you to develop smart, quick and self-learning systems in your business surroundings. It is an effecti…☆11Updated 5 years ago
- Official implementation of "Graph Meta-Reinforcement Learning for TransferableAutonomous Mobility-on-Demand"☆14Updated 3 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Updated 7 years ago
- PyTorch implementation of PPO algorithm☆21Updated 5 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 7 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆25Updated 5 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆12Updated 3 years ago
- ☆42Updated 3 weeks ago
- Recurrent Reinforcement Learning Algorithm Matlab Implementation☆46Updated 4 years ago
- Multi-Objective Reinforcement Learning sandbox☆10Updated 3 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11Updated 3 years ago
- ☆10Updated 4 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- PyTorch implementation of various reinforcement learning algorithms☆18Updated 7 years ago
- ☆11Updated 5 years ago
- ☆16Updated 6 years ago
- Risk-Averse Distributional Reinforcement Learning: Code☆26Updated 6 years ago
- ☆36Updated 8 years ago
- Code release for AAAI 2020 paper "Smart Predict-and-Optimize for Hard Combinatorial Optimization Problems"☆38Updated 9 months ago