Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Nov 14, 2019Updated 6 years ago
Alternatives and similar repositories for WeightedLinearBandits
Users that are interested in WeightedLinearBandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆103Dec 14, 2021Updated 4 years ago
- Google AI Princeton control framework☆39Nov 2, 2020Updated 5 years ago
- ☆14Jun 7, 2023Updated 3 years ago
- Useful tools and practices for Python development☆18Jul 27, 2020Updated 5 years ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- ☆12Mar 19, 2018Updated 8 years ago
- ☆12May 8, 2020Updated 6 years ago
- ALNS Algorithm which optimise MINLP railroad network models (applied to Madrid's network)☆13Sep 5, 2017Updated 8 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Performant, differentiable reinforcement learning☆23Jun 16, 2023Updated 3 years ago
- Companion code to CoRL 2019 paper: E Bıyık, M Palan, NC Landolfi, DP Losey, D Sadigh. "Asking Easy Questions: A User-Friendly Approach to…☆18Oct 13, 2020Updated 5 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Code for 'Diff-MSR: A Diffusion Model Enhanced Paradigm for Cold-Start Multi-Scenario Recommendation' accepted to WSDM 2024☆14Aug 1, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- (WSDM'24) Cross-modal Self-Supervised Learning for Time-series through Latent Masking☆19Feb 20, 2024Updated 2 years ago
- RL CIRL Research☆13Dec 8, 2022Updated 3 years ago
- A bottom-up model for the simulation of heat demand profiles of urban areas☆13Dec 11, 2023Updated 2 years ago
- ☆10May 22, 2023Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- PyTorch training at CSCS☆22Jul 4, 2025Updated last year
- ☆25Feb 9, 2016Updated 10 years ago
- Supplementary material for the paper published at ACM RecSys 2021 and its extended version accepted to ACM TORS journal☆20Jan 28, 2023Updated 3 years ago
- COBS: COmprehensive Building Simulator☆16Jun 23, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 6 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 6 years ago
- ☆12Aug 30, 2021Updated 4 years ago
- ☆85Nov 19, 2020Updated 5 years ago
- Code for NeurIPS 2019 paper: "Symmetry-Based Disentangled Representation Learning requires Interaction with Environments" by H. Caselles-…☆34Dec 9, 2019Updated 6 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- Pre-training and Transfer learning papers for recommendation☆18Mar 9, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Apr 17, 2019Updated 7 years ago
- Deep recommendation system☆13Dec 28, 2016Updated 9 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆116Dec 5, 2023Updated 2 years ago
- Comparison of gradient estimation techniques for black-box adversarial examples☆11Oct 31, 2018Updated 7 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆25Dec 29, 2023Updated 2 years ago
- Tools for robustness evaluation in interpretability methods☆10Jun 25, 2021Updated 5 years ago
- Implementation of Deep Q-Network(DQN)and Model Predictive Control, and their evaluation on the Quanser robot platform☆15Jul 24, 2020Updated 5 years ago