wadx2019 / Neural-BanditLinks
A collection of the pytorch implementation of neural bandit algorithm includes neuralUCB(Neural Contextual Bandits with UCB-based Exploration) and neuralTS(Neural Thompson sampling)
☆20Updated 3 weeks ago
Alternatives and similar repositories for Neural-Bandit
Users that are interested in Neural-Bandit are comparing it to the libraries listed below
Sorting:
- ☆39Updated 5 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆96Updated 3 years ago
- A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges☆248Updated 6 months ago
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆18Updated 3 years ago
- Code of NeurIPS paper: arxiv.org/abs/2302.08224☆214Updated 11 months ago
- Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML…☆40Updated 4 years ago
- A curated list of causal reinforcement learning resources.☆96Updated last year
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Updated 3 years ago
- Multi-Objective Reinforcement Learning☆282Updated 4 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆47Updated 2 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆73Updated last year
- Code implementation for NeurIPS 2019 submission 'Reinforcement Learning for Integer Programming: Learning to Cut'☆40Updated 6 years ago
- A collection of offline reinforcement learning algorithms.☆194Updated 8 months ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Learning to branch with reinforcement learning using retrospective trajectories for exact combinatorial optimisation.☆34Updated 2 years ago
- Exact Pareto Optimal solutions for preference based Multi-Objective Optimization☆65Updated 3 years ago
- Deep-Learning-powered-Iterative-Combinatorial Auctions☆14Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆139Updated last year
- Robust Reinforcement Learning with the Alternating Training of Learned Adversaries (ATLA) framework☆68Updated 4 years ago
- ☆132Updated last year
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 3 years ago
- Implementation of the VIPER algorithm introduced in "Verifiable Reinforcement Learning via Policy Extraction" by Bastani et al.☆15Updated last year
- Representation Learning for RL☆126Updated 2 years ago
- ☆20Updated 3 years ago
- Reimplementation of "Exact Combinatorial Optimization with Graph Convolutional Neural Networks" (NeurIPS 2019)☆42Updated 11 months ago
- ☆25Updated 3 years ago
- ☆36Updated last year
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆135Updated 3 years ago
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆65Updated 2 years ago
- ☆12Updated 5 years ago