wadx2019 / Neural-BanditLinks
A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Exploration) and neuralTS(Neural Thompson sampling)
☆25Updated 4 months ago
Alternatives and similar repositories for Neural-Bandit
Users that are interested in Neural-Bandit are comparing it to the libraries listed below
Sorting:
- ☆44Updated 5 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆153Updated 2 years ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆44Updated 4 years ago
- ☆132Updated last year
- Code for FOCAL Paper Published at ICLR 2021☆52Updated last year
- A collection of research and survey papers of hierarchical reinforcement learning (HRL).☆52Updated 5 years ago
- Representation Learning for RL☆128Updated 2 years ago
- ☆30Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆79Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆205Updated 11 months ago
- ☆34Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆62Updated 3 years ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆19Updated 3 years ago
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆68Updated 4 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆197Updated 2 years ago
- ☆20Updated 3 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆56Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆189Updated 3 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆34Updated 4 years ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆254Updated 9 months ago
- Multi-Objective Reinforcement Learning☆291Updated 4 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆71Updated 2 years ago
- Paper Collection for Batch RL with brief introductions.☆84Updated 3 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆162Updated 5 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- A curated list of causal reinforcement learning resources.☆105Updated last year
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆71Updated 3 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆47Updated 2 years ago