SahanaRamnath / MultiArmedBandit_RLLinks

Implementation of various multi-armed bandits algorithms on a 10-arm testbed.

☆38

Alternatives and similar repositories for MultiArmedBandit_RL

Users that are interested in MultiArmedBandit_RL are comparing it to the libraries listed below

Sorting:

akhadangi / Multi-armed-Bandits
In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…
☆87Updated 4 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆96Updated 3 years ago
alison-carrera / mabalgs
Multi-Armed Bandit Algorithms Library (MAB)
☆133Updated 2 years ago
lilianweng / multi-armed-bandit
Play with the solutions to the multi-armed-bandit problem.
☆416Updated last year
BartyzalRadek / contextual-bandits-recommender
Implementing LinUCB and HybridLinUCB in Python.
☆49Updated 7 years ago
Lunj12 / RL-Bandits-with-Knapsacks
Dynamic Pricing BwK Problem and Reinforcement Learning
☆31Updated 6 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆177Updated 7 years ago
banditml / banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆66Updated 4 years ago
sadighian / recommendation-gym
MovieLens recommendation system using reinforcement learning (GYM + PPO)
☆49Updated 5 years ago
orrivlin / MinimumVertexCover_DRL
Learning to solve Minimum Vertex Cover using Graph Convolutional Networks and RL
☆77Updated 6 years ago
appurwar / Contextual-Bandit-News-Article-Recommendation
Predict and recommend the news articles, user is most likely to click in real time.
☆32Updated 7 years ago
uclaml / NeuralUCB
☆39Updated 5 years ago
henryslzhao / RL4Recsys
paper list in the area of reinforcenment learning for recommendation systems
☆24Updated 5 years ago
iankurgarg / Reinforcement-Learning-Feature-Selection
Feature selection for maximizing expected cumulative reward
☆30Updated 7 years ago
tuomaso / radial_rl
Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"
☆33Updated last year
Jinjiarui / rl4rs-papers
A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.
☆72Updated 5 years ago
sshkhr / Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
☆54Updated 3 years ago
guyulongcs / Awesome-Deep-Reinforcement-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Reinforcement Learning papers for industrial Search, Recommendation and Advertising.
☆214Updated 4 years ago
xuyxu / Soft-Decision-Tree
PyTorch Implementation of "Distilling a Neural Network Into a Soft Decision Tree." Nicholas Frosst, Geoffrey Hinton., 2017.
☆102Updated last year
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆71Updated 5 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
PacktPublishing / PyTorch-1.x-Reinforcement-Learning-Cookbook
PyTorch 1.x Reinforcement Learning Cookbook, published by Packt
☆100Updated 2 years ago
gdmarmerola / interactive-intro-rl
Big Data's open seminars: An Interactive Introduction to Reinforcement Learning
☆64Updated 4 years ago
mktal / kddcup-starting-kit
The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.
☆88Updated 4 years ago
PKU-RL / FEN
FEN Code
☆38Updated 5 years ago
collinprather / SlateQ
A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms
☆37Updated 2 years ago
massquantity / DBRL
Dataset Batch(offline) Reinforcement Learning for recommender system
☆153Updated 4 years ago
JianGuanTHU / IRecGAN
Implementation for our paper in NeurIPS 2019
☆48Updated 5 years ago
lnpalmer / A2C
PyTorch implementation of Advantage Actor-Critic (A2C)
☆45Updated 7 years ago
ChangyWen / wolpertinger_ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…
☆65Updated 2 years ago