thanhnguyentang / offline_neural_banditsLinks

An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR 2022.

☆13

Alternatives and similar repositories for offline_neural_bandits

Users that are interested in offline_neural_bandits are comparing it to the libraries listed below

Sorting:

facebookresearch / SurCo
Repo for ICML'23 paper SurCo Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems
☆18Updated 2 years ago
jparkerholder / ASEBO
Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …
☆16Updated 4 years ago
antoinedesir / RUMnet
☆9Updated last year
fmaxgarcia / Meta-MDP
☆11Updated 5 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
cornell-orie / ORSuite
☆14Updated last year
qiang-ma / HRL-for-combinatorial-optimization
Hierarchical deep reinforcement learning for combinatorial optimization problem
☆35Updated 5 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆96Updated 3 years ago
RingBDStack / SR-MARL
☆12Updated 2 years ago
lyeskhalil / CORL
☆25Updated 3 years ago
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆12Updated 3 years ago
hsvgbkhgbv / shapley-q-learning
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
☆47Updated last year
michaelzcjia / smart_predict_optimize
MIE424 Group Project: smart_predict_optimize
☆14Updated 4 years ago
Adaptive-RL / AdaRL-code
Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…
☆38Updated last year
uclaml / NeuralUCB
☆39Updated 5 years ago
wyjung0625 / p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Updated 5 years ago
npvoid / OnlineDoubleOracle
☆11Updated 4 years ago
chaitjo / learning-paradigms-for-tsp
Code for the paper 'On Learning Paradigms for the Travelling Salesman Problem' (NeurIPS 2019 Graph Representation Learning Workshop)
☆33Updated 4 years ago
dtak / POPCORN-POMDP
Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)
☆11Updated 4 years ago
mila-iqia / Conscious-Planning
Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".
☆59Updated 10 months ago
bwilder0 / aaai_melding_code
Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"
☆34Updated 4 years ago
keep9oing / GNN_RL
reinforcement learning with pytorch geometric library
☆50Updated 3 years ago
CausalRL / DRL
Deconfounding Reinforcement Learning in Observational Settings
☆52Updated 6 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
XanderJC / scalable-birl
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
☆47Updated 4 years ago
011235813 / discrete_mean_field_game
Experiments on a discrete mean field game model of population dynamics with reinforcement learning
☆36Updated last year
vignesh-viswanathan / Bayesian-Stackelberg-Games
The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.
☆28Updated 6 years ago
lmzintgraf / gp_pref_elicit
Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes
☆23Updated 7 years ago
Sadie-Zhao / Zero-Sum-Stochastic-Stackelberg-Games-NeurIPS
This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".
☆15Updated 2 years ago