Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆99Dec 14, 2021Updated 4 years ago
Alternatives and similar repositories for neural_exploration
Users that are interested in neural_exploration are comparing it to the libraries listed below
Sorting:
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆71Jun 4, 2021Updated 4 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Nov 14, 2019Updated 6 years ago
- ☆49Jul 4, 2020Updated 5 years ago
- [AAAI 2021 Workshop] The official repository for the LST-MAP model for few-shot image classification.☆13Feb 12, 2021Updated 5 years ago
- ☆11Aug 10, 2020Updated 5 years ago
- Python implementations of contextual bandits algorithms☆823Feb 22, 2026Updated 2 weeks ago
- Quant finance scripts☆15Apr 13, 2025Updated 10 months ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Multi Armed Bandits implementation using the Yahoo! Front Page Today Module User Click Log Dataset☆99Oct 21, 2021Updated 4 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆43Sep 4, 2019Updated 6 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Le…☆18Apr 13, 2021Updated 4 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆13Mar 13, 2022Updated 3 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆32Apr 3, 2018Updated 7 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- Solving the Travelling Salesman Problem, with applying the hard constraints using the QAutoencoder☆11May 11, 2022Updated 3 years ago
- A tool to understand how buildings perform in terms of occupant comfort. https://lmnarchitects.com/tech-studio/☆10Oct 7, 2019Updated 6 years ago
- PyTorch implementation of the ICML 2020 paper "Latent Bernoulli Autoencoder"☆25Apr 8, 2021Updated 4 years ago
- ☆106Sep 13, 2021Updated 4 years ago
- PSYCH 291: Causal Cognition (https://tobiasgerstenberg.github.io/causal_cognition/)☆12May 23, 2019Updated 6 years ago
- The repo for Shen Group's FMAB repo☆11Jan 21, 2021Updated 5 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spaces☆11Nov 4, 2021Updated 4 years ago
- The code for the paper "Spatio-Temporal Structured Sparse Regression With Hierarchical Gaussian Process Priors" by Danil Kuzin, Olga Isup…☆13Nov 19, 2018Updated 7 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Oct 6, 2022Updated 3 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- ☆29May 27, 2024Updated last year
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆135Sep 6, 2022Updated 3 years ago
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Feb 2, 2021Updated 5 years ago
- Implementing Visual Saliency Models☆13Jan 10, 2018Updated 8 years ago
- ☆14Oct 7, 2022Updated 3 years ago
- Yahoo! news article recommendation system by linUCB☆112Feb 1, 2018Updated 8 years ago
- All things done for IIIT research.☆11Oct 28, 2020Updated 5 years ago
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- RL CIRL Research☆13Dec 8, 2022Updated 3 years ago
- This repository contains codes for paper: Generalized Linear Bandits with Local Differential Privacy by Yuxuan Han, Zhipeng Liang, Yang W…☆16Oct 26, 2021Updated 4 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Dec 8, 2022Updated 3 years ago
- [IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library☆279Sep 5, 2024Updated last year