sauxpa / neural_explorationLinks

Study NeuralUCB and regret analysis for contextual bandit with neural decision

☆96

Alternatives and similar repositories for neural_exploration

Users that are interested in neural_exploration are comparing it to the libraries listed below

Sorting:

uclaml / NeuralUCB
☆39Updated 5 years ago
akhadangi / Multi-armed-Bandits
In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandit…
☆87Updated 4 years ago
henryslzhao / RL4Recsys
paper list in the area of reinforcenment learning for recommendation systems
☆24Updated 5 years ago
CausalRL / DRL
Deconfounding Reinforcement Learning in Observational Settings
☆52Updated 6 years ago
fuxiAIlab / RL4RS
A Real-World Benchmark for Reinforcement Learning based Recommender System
☆229Updated last year
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
saisrivatsan / deep-opt-auctions
Implementation of Optimal Auctions through Deep Learning
☆129Updated 5 years ago
wadx2019 / Neural-Bandit
A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Explora…
☆20Updated 3 weeks ago
sadighian / recommendation-gym
MovieLens recommendation system using reinforcement learning (GYM + PPO)
☆49Updated 5 years ago
banditml / banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
☆66Updated 4 years ago
google-research / deep_ope
☆86Updated last year
MadryLab / implementation-matters
☆132Updated last year
dbmptr / EPOSearch
Exact Pareto Optimal solutions for preference based Multi-Objective Optimization
☆65Updated 3 years ago
amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 3 years ago
thanhnguyentang / offline_neural_bandits
An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…
☆13Updated 3 years ago
guyulongcs / Awesome-Deep-Reinforcement-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Reinforcement Learning papers for industrial Search, Recommendation and Advertising.
☆214Updated 4 years ago
XueyingBai / Model-Based-Reinforcement-Learning-for-Online-Recommendation
A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.
☆40Updated 5 years ago
andrecianflone / thompson
Thompson Sampling Tutorial
☆54Updated 6 years ago
clvoloshin / COBS
OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.
☆61Updated 2 years ago
voiler / PopulationBasedTraining
A simple PyTorch implementation of Population Based Training of Neural Networks.
☆63Updated 6 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆177Updated 7 years ago
chaovven / maab
Code for "A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising" WSDM 2022
☆23Updated 3 years ago
mktal / kddcup-starting-kit
The submission template for the Learning to Dispatch and Reposition Competition @ KDD2020.
☆88Updated 4 years ago
jparkerholder / PB2
Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.
☆20Updated 4 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆95Updated last month
lilianweng / multi-armed-bandit
Play with the solutions to the multi-armed-bandit problem.
☆416Updated last year
zoulixin93 / pseudo_dyna_q
☆14Updated 5 years ago
ChangyWen / wolpertinger_ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…
☆65Updated 2 years ago
xinshi-chen / GenerativeAdversarialUserModel
Tensorflow implementation for "Generative Adversarial User Model forReinforcement Learning Based Recommendation System"
☆129Updated 5 years ago
wangyuhuix / TRGPPO
☆32Updated 2 years ago