Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Nov 14, 2019Updated 6 years ago
Alternatives and similar repositories for WeightedLinearBandits
Users that are interested in WeightedLinearBandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Google AI Princeton control framework☆39Nov 2, 2020Updated 5 years ago
- ☆14Jun 7, 2023Updated 2 years ago
- Useful tools and practices for Python development☆18Jul 27, 2020Updated 5 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- simple multi-class GBDT☆15Feb 24, 2014Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Twitter follower graphs of @Die_Gruenen & @AfD, including cluster and topic analysis☆10Jul 10, 2020Updated 5 years ago
- ALNS Algorithm which optimise MINLP railroad network models (applied to Madrid's network)☆13Sep 5, 2017Updated 8 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Performant, differentiable reinforcement learning☆23Jun 16, 2023Updated 2 years ago
- Companion code to CoRL 2019 paper: E Bıyık, M Palan, NC Landolfi, DP Losey, D Sadigh. "Asking Easy Questions: A User-Friendly Approach to…☆18Oct 13, 2020Updated 5 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- RL CIRL Research☆13Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Quant finance scripts☆15Apr 13, 2025Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Supplementary material for the paper published at ACM RecSys 2021 and its extended version accepted to ACM TORS journal☆20Jan 28, 2023Updated 3 years ago
- Dockerfile that is used for the JModelica regression testing of the Buildings library and of BuildingsPy☆16Nov 22, 2023Updated 2 years ago
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- 🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…☆420Apr 30, 2024Updated last year
- NCSU CSC-326 Course Page☆12Dec 5, 2018Updated 7 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Aug 30, 2021Updated 4 years ago
- ☆14Sep 7, 2024Updated last year
- ☆84Nov 19, 2020Updated 5 years ago
- Code for NeurIPS 2019 paper: "Symmetry-Based Disentangled Representation Learning requires Interaction with Environments" by H. Caselles-…☆35Dec 9, 2019Updated 6 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Nov 7, 2018Updated 7 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Chainer implementation of WGAN-GP.☆12Oct 4, 2017Updated 8 years ago
- ☆18Apr 17, 2019Updated 6 years ago
- Code for SPIBB-DQN and Soft-SPIBB-DQN☆11May 5, 2020Updated 5 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- ☆20Sep 1, 2021Updated 4 years ago
- Tools for robustness evaluation in interpretability methods☆10Jun 25, 2021Updated 4 years ago
- Implementation of Deep Q-Network(DQN)and Model Predictive Control, and their evaluation on the Quanser robot platform☆15Jul 24, 2020Updated 5 years ago