Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Nov 14, 2019Updated 6 years ago
Alternatives and similar repositories for WeightedLinearBandits
Users that are interested in WeightedLinearBandits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆102Dec 14, 2021Updated 4 years ago
- Google AI Princeton control framework☆39Nov 2, 2020Updated 5 years ago
- ☆14Jun 7, 2023Updated 2 years ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12May 8, 2020Updated 5 years ago
- Source code for EMSE 2023 paper "Zero-Shot Code Representation Learning via Prompt Tuning"☆13Feb 15, 2023Updated 3 years ago
- Some of Our Audit Reports, Presentations, etc☆13Mar 26, 2024Updated 2 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Performant, differentiable reinforcement learning☆23Jun 16, 2023Updated 2 years ago
- Companion code to CoRL 2019 paper: E Bıyık, M Palan, NC Landolfi, DP Losey, D Sadigh. "Asking Easy Questions: A User-Friendly Approach to…☆18Oct 13, 2020Updated 5 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Code for 'Diff-MSR: A Diffusion Model Enhanced Paradigm for Cold-Start Multi-Scenario Recommendation' accepted to WSDM 2024☆13Aug 1, 2025Updated 9 months ago
- RL CIRL Research☆13Dec 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A bottom-up model for the simulation of heat demand profiles of urban areas☆13Dec 11, 2023Updated 2 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- ☆10May 22, 2023Updated 2 years ago
- This project enables hyperledger fabric to evolve the iov energy trading☆10Apr 29, 2022Updated 4 years ago
- Supplementary material for the paper published at ACM RecSys 2021 and its extended version accepted to ACM TORS journal☆20Jan 28, 2023Updated 3 years ago
- ☆25Feb 9, 2016Updated 10 years ago
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 5 years ago
- Dockerfile that is used for the JModelica regression testing of the Buildings library and of BuildingsPy☆16Nov 22, 2023Updated 2 years ago
- A RAG system is just the beginning of harnessing the power of LLM. The next step is creating an intelligent Agent. In Agentic RAG the Ag…☆14May 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- 2013 Fall Cloud Computing Project for Nerve Cloud group: MapReduce-Based Deep Learning☆15Dec 2, 2013Updated 12 years ago
- 🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…☆422Apr 30, 2024Updated 2 years ago
- ☆11Dec 26, 2022Updated 3 years ago
- Welcome to FLSim_V2, a PyTorch based federated Reinforcement learning simulation framework☆10Dec 15, 2022Updated 3 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 6 years ago
- ☆12Aug 30, 2021Updated 4 years ago
- Code repository for technical papers about selfish mining analysis.☆13May 16, 2023Updated 2 years ago
- ☆85Nov 19, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AGAC: Adversarially Guided Actor-Critic☆47Sep 16, 2021Updated 4 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Nov 7, 2018Updated 7 years ago
- Code for the paper "Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets", ICCV 2019☆27Mar 17, 2020Updated 6 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- ☆18Apr 17, 2019Updated 7 years ago