cfoh / Multi-Armed-Bandit-Example

Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, Deep MAB.

☆25

Related projects ⓘ

Alternatives and complementary repositories for Multi-Armed-Bandit-Example

Lunj12 / RL-Bandits-with-Knapsacks
Dynamic Pricing BwK Problem and Reinforcement Learning
☆29Updated 5 years ago
transparent-framework / optimize-ride-sharing-earnings
A GitHub repository associated with paper "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet"
☆9Updated 4 years ago
axelabels / DynMORL
Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning
☆88Updated last year
laxatives / rl
Illustrated Examples from Sutton and Barto
☆35Updated last year
banditml / banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆65Updated 3 years ago
UMich-ML-Group / RL-Ridesharing
Effcient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning
☆46Updated 4 years ago
BNN-UPC / ENERO
Code used in the paper "ENERO: Efficient real-time WAN routing optimization with Deep Reinforcement Learning". In this paper, the DRL age…
☆23Updated last year
tbasaklar / PDMORL-Preference-Driven-Multi-Objective-Reinforcement-Learning-Algorithm
A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…
☆25Updated 11 months ago
paulalmasan / DRL-GNN-PPO
PPO implementation of the DRL agent used in the paper "Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optim…
☆79Updated 2 years ago
kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆78Updated 4 years ago
laohuu / reinforcement_learning
Reinforcement Learning Algorithms Based on PyTorch
☆17Updated 2 years ago
netx-repo / neuroplan
☆87Updated last year
venktesh22 / ExpressLanes_Deep-RL
☆26Updated 4 years ago
willzhang3 / MASTER-electric_vehicle_charging_recommendation
☆49Updated 2 years ago
Metro1998 / hppo-in-traffic-signal-control
☆43Updated 6 months ago
FXDevailly / IG-RL
Inductive Graph Reinforcement Learning for Massive-Scale Traffic Signal Control
☆56Updated 2 years ago
Steven-Ho / madrl-baselines
Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.
☆11Updated 4 years ago
willzhang3 / MAGC-dynamic_electric_vehicle_charging_pricing
Multi-Agent Graph Convolutional Reinforcement Learning for Dynamic Electric Vehicle Charging Pricing
☆47Updated last year
verystrongjoe / qmix
qmix
☆22Updated 4 years ago
dingyuan-shi / Learning-To-Dispatch
Source code of paper Combinatorial Optimization Meets Reinforcement Learning: Effective Taxi Order Dispatching at Large-Scale (TKDE 2022)…
☆22Updated 2 years ago
LMozart / sumo-multiagent
This is a multi agent reinforcement learning system using SUMO for large scale traffic light control
☆26Updated 4 years ago
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆66Updated 4 years ago
DanieleGammelli / gnn-rl-for-amod
Official implementation of "Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand
☆71Updated 3 years ago
wingsweihua / colight
CoLight: Learning Network-level Cooperation for Traffic Signal Control
☆160Updated last year
Quantum-Cheese / DeepReinforcementLearning_Pytorch
Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym
☆53Updated 3 years ago
lrhammond / lmorl
Lexicographic Multi-Objective Reinforcement Learning
☆10Updated last year
skumar9876 / FCRL
Implementation of "Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning" (https://arxiv.org/pdf/1712.08266.pdf)
☆36Updated 6 years ago
lehduong / Job-Scheduling-with-Reinforcement-Learning
Learning in Noisy MDP (which is governed by stochastic, exogenous input processes) with input-dependent baseline
☆11Updated 4 years ago
nathangrinsztajn / RL_for_dynamic_scheduling
Implementation of the paper "A Reinforcement Learning Based Strategy for Dynamic Scheduling on Heterogeneous Platforms".
☆74Updated last year
vaibkumr / JobSchedulingRLenv
Reinforcement learning environment for job scheduling written in python.
☆23Updated 4 years ago