CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆11Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
- The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.☆25Updated 6 years ago
- ☆20Updated 5 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆20Updated 5 years ago
- ☆26Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆24Updated 5 years ago
- qmix☆22Updated 4 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 5 years ago
- Link to paper: https://www.ssrn.com/abstract=3804655☆13Updated 3 years ago
- Dynamic Pricing BwK Problem and Reinforcement Learning☆29Updated 5 years ago
- PyTorch implementation of MATD3☆12Updated 4 years ago
- Combining Evolutionary Algorithms and deep Reinforcement Learning☆14Updated 6 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆25Updated last year
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆15Updated 2 years ago
- Multiagent deep reinforcement learning research project☆27Updated 5 months ago
- ☆16Updated 6 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- ☆10Updated 2 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆30Updated 3 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Solving POMDPs using exact and approximate methods☆13Updated 7 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …☆36Updated 8 years ago
- Experiments on a discrete mean field game model of population dynamics with reinforcement learning☆31Updated last year
- Integration of DNN framework with Stochastic Multi-echelon Inventory Optimization (SMEIO)☆8Updated 3 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆55Updated 2 years ago
- Implement Google Deep Minds DQN for multiple agents for a grid world environment where vehicles must pick up customers.☆27Updated 6 years ago
- ☆26Updated 6 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆34Updated 2 years ago