Hybrid Linear UCB Multi-arm Bandit library
☆14Oct 5, 2016Updated 9 years ago
Alternatives and similar repositories for hybrid-linucb
Users that are interested in hybrid-linucb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dynamic channel allocation in cellular networks by reinforcement learning☆18May 25, 2022Updated 4 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Hybrid Linear UCB bandit learning algorithm L Li(2010) python code☆56Dec 23, 2015Updated 10 years ago
- Repository for Conflict Urbanism: Aleppo, Center for Spatial Research, Columbia University☆13Mar 20, 2016Updated 10 years ago
- This repository includes the source code for simulating traffic in AIMSUN with autonomous vehicles.☆11Aug 4, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆12Nov 9, 2018Updated 7 years ago
- Cyclopath is an online bicycle map and trip planner for all types of cyclists. It's also an inventory management and analytics engine for…☆18Jul 2, 2020Updated 5 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- ☆12Mar 23, 2018Updated 8 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- A Telegram bot that you can log to from Python and manage long running processes.☆27Dec 8, 2022Updated 3 years ago
- Some experimental scripts for running IQFeed on Debian GNU/Linux☆16Feb 16, 2014Updated 12 years ago
- ☆15Feb 25, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22Oct 23, 2015Updated 10 years ago
- 股票高频数据(数据来源:新浪)☆13Jan 29, 2020Updated 6 years ago
- This repository contains implementations of the paper VUSFA☆14Mar 31, 2021Updated 5 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 7 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Jun 5, 2018Updated 8 years ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Python application to setup and run streaming (contextual) bandit experiments.☆85Sep 4, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- ☆11Jul 23, 2023Updated 2 years ago
- ☆21Sep 17, 2025Updated 9 months ago
- ☆10Nov 21, 2022Updated 3 years ago
- An interactive story app for Android . . .☆15Dec 14, 2014Updated 11 years ago
- Prediction of box office success using Google Trends data☆11Dec 5, 2019Updated 6 years ago
- Data for the paper "A Dataset for Learning University STEM Courses at Scale" by Zhang et al., 2022.☆15Nov 22, 2022Updated 3 years ago
- Collection of (unfinished) notebooks☆14Sep 16, 2020Updated 5 years ago
- Paper notes for my PhD on Machine Learning (mostly focused on Reinforcement Learning)☆17Jul 22, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆25May 9, 2021Updated 5 years ago
- Files from the published Alpha Star paper by DeepMind☆18Nov 14, 2019Updated 6 years ago
- Codes for 'Deep Deterministic Information Bottleneck with Matrix-based entropy functional' in ICASSP 2021☆13Jul 27, 2022Updated 3 years ago
- This repository implements Pozzolo, et al., (2015)'s probability calibration for imbalanced data.☆13Dec 3, 2024Updated last year
- ☆12Dec 21, 2024Updated last year
- A collection of functions for use in Excel Power Query☆31Sep 21, 2017Updated 8 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago