wadx2019 / Neural-BanditLinks
A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Exploration) and neuralTS(Neural Thompson sampling)
☆25Updated 5 months ago
Alternatives and similar repositories for Neural-Bandit
Users that are interested in Neural-Bandit are comparing it to the libraries listed below
Sorting:
- ☆48Updated 5 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 4 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆19Updated 3 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆49Updated 3 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆62Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆54Updated 2 years ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆45Updated 4 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- A curated list of causal reinforcement learning resources.☆105Updated 2 years ago
- ☆30Updated 3 years ago
- ☆133Updated last year
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆67Updated 4 years ago
- A collection of offline reinforcement learning algorithms.☆207Updated last year
- Code for conservative Q-learning☆468Updated 4 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆79Updated 3 years ago
- ☆20Updated 3 years ago
- ☆16Updated 2 years ago
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆52Updated 5 years ago
- ☆34Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆146Updated last year
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Official implementation of ICML'24 paper "Offline Multi-Objective Optimization".☆24Updated 7 months ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Updated 2 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆35Updated 4 years ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆258Updated 11 months ago
- A list of Offline to Online RL papers (continually updated)☆61Updated last month
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 6 years ago
- Representation Learning for RL☆129Updated 2 years ago
- ☆315Updated 3 years ago
- ☆12Updated 5 years ago