CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Updated 6 years ago
Alternatives and similar repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting:
Users that are interested in Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting are comparing it to the libraries listed below
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆26Updated 6 years ago
- ☆20Updated 5 years ago
- ☆26Updated 4 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising☆26Updated 4 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆15Updated 6 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- meta-MADDPG (Python implementation)☆18Updated 6 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆25Updated 5 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- qmix☆22Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- ☆10Updated 4 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆20Updated 6 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆29Updated 6 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆19Updated 7 years ago
- Multiagent deep reinforcement learning research project☆27Updated 8 months ago
- ☆11Updated 5 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Deep Reinforcement Learning for Nash Equilibria☆41Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆20Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- ☆16Updated 6 years ago
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆37Updated 8 years ago
- FEN Code☆37Updated 5 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆10Updated 2 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 5 years ago
- TD-Regularized Actor-Critic Methods☆34Updated 5 years ago
- Integration of DNN framework with Stochastic Multi-echelon Inventory Optimization (SMEIO)☆8Updated 3 years ago
- Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.☆11Updated 5 years ago