kulinshah98 / Multi-Armed-Bandit-AlgorithmsLinks

Python implementation of UCB, EXP3 and Epsilon greedy algorithms

☆29

Alternatives and similar repositories for Multi-Armed-Bandit-Algorithms

Users that are interested in Multi-Armed-Bandit-Algorithms are comparing it to the libraries listed below

Sorting:

sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆99Updated 4 years ago
lilianweng / multi-armed-bandit
Play with the solutions to the multi-armed-bandit problem.
☆415Updated last year
SMPyBandits / SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…
☆416Updated last year
alison-carrera / mabalgs
Multi-Armed Bandit Algorithms Library (MAB)
☆133Updated 3 years ago
uclaml / NeuralUCB
☆48Updated 5 years ago
kfoofw / bandit_simulations
Bandit algorithms simulations for online learning
☆89Updated 5 years ago
HCDM / BanditLib
Library of contextual bandits algorithms
☆336Updated last year
ntucllab / striatum
Contextual bandit in python
☆112Updated 4 years ago
gdmarmerola / advanced-bandit-problems
More about the exploration-exploitation tradeoff with harder bandits
☆24Updated 6 years ago
akhadangi / Multi-armed-Bandits
In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…
☆89Updated 5 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆178Updated 7 years ago
Mathtodon / Contextual_Bandits_Tree
Contains Code for Contextual Bandits Decision Tree
☆20Updated 6 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 6 years ago
wadx2019 / Neural-Bandit
A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Explora…
☆25Updated 5 months ago
mynkpl1998 / Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
☆92Updated 5 years ago
PKU-RL / FEN
FEN Code
☆40Updated 6 years ago
tushuhei / gpucb
Simple implementation of GP-UCB algorithm.
☆54Updated 8 years ago
kazizzad / BDQN-MxNet-Gluon
Efficient Exploration through Bayesian Deep Q-Networks
☆37Updated 7 years ago
bgalbraith / bandits
Python library for Multi-Armed Bandits
☆766Updated 5 years ago
annieyan / Bandits-using-UCB-algorithm
Thompson Sampling for Bandits using UCB policy
☆10Updated 8 years ago
david-cortes / contextualbandits
Python implementations of contextual bandits algorithms
☆814Updated 6 months ago
zoulixin93 / pseudo_dyna_q
☆14Updated 5 years ago
CausalRL / DRL
Deconfounding Reinforcement Learning in Observational Settings
☆51Updated 6 years ago
ardaegeunlu / Contextual-Gaussian-Process-Bandit-Optimization
Simple implementation of the CGP-UCB algorithm.
☆38Updated 6 years ago
fiezt / ICML-2020-Implicit-Stackelberg-Learning
☆12Updated 5 years ago
hongzimao / input_driven_rl_example
Variance Reduction for Reinforcement Learning in Input-Driven Environments (ICLR '19)
☆31Updated 6 years ago
thanhnguyentang / offline_neural_bandits
An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…
☆13Updated 3 years ago
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆71Updated 6 years ago
marketdesignresearch / DL-ICA
Deep-Learning-powered-Iterative-Combinatorial Auctions
☆14Updated 2 years ago
sadighian / recommendation-gym
MovieLens recommendation system using reinforcement learning (GYM + PPO)
☆50Updated 5 years ago