cfoh / Multi-Armed-Bandit-Example
Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, Deep MAB.
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Multi-Armed-Bandit-Example
- Dynamic Pricing BwK Problem and Reinforcement Learning☆29Updated 5 years ago
- A GitHub repository associated with paper "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet"☆9Updated 4 years ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆88Updated last year
- Illustrated Examples from Sutton and Barto☆35Updated last year
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆65Updated 3 years ago
- Effcient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning☆46Updated 4 years ago
- Code used in the paper "ENERO: Efficient real-time WAN routing optimization with Deep Reinforcement Learning". In this paper, the DRL age…☆23Updated last year
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆25Updated 11 months ago
- PPO implementation of the DRL agent used in the paper "Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optim…☆79Updated 2 years ago
- Bandit algorithms simulations for online learning☆78Updated 4 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆17Updated 2 years ago
- ☆87Updated last year
- ☆26Updated 4 years ago
- ☆49Updated 2 years ago
- ☆43Updated 6 months ago
- Inductive Graph Reinforcement Learning for Massive-Scale Traffic Signal Control☆56Updated 2 years ago
- Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.☆11Updated 4 years ago
- Multi-Agent Graph Convolutional Reinforcement Learning for Dynamic Electric Vehicle Charging Pricing☆47Updated last year
- qmix☆22Updated 4 years ago
- Source code of paper Combinatorial Optimization Meets Reinforcement Learning: Effective Taxi Order Dispatching at Large-Scale (TKDE 2022)…☆22Updated 2 years ago
- This is a multi agent reinforcement learning system using SUMO for large scale traffic light control☆26Updated 4 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆66Updated 4 years ago
- Official implementation of "Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand☆71Updated 3 years ago
- CoLight: Learning Network-level Cooperation for Traffic Signal Control☆160Updated last year
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆53Updated 3 years ago
- Lexicographic Multi-Objective Reinforcement Learning☆10Updated last year
- Implementation of "Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning" (https://arxiv.org/pdf/1712.08266.pdf)☆36Updated 6 years ago
- Learning in Noisy MDP (which is governed by stochastic, exogenous input processes) with input-dependent baseline☆11Updated 4 years ago
- Implementation of the paper "A Reinforcement Learning Based Strategy for Dynamic Scheduling on Heterogeneous Platforms".☆74Updated last year
- Reinforcement learning environment for job scheduling written in python.☆23Updated 4 years ago