Hybrid Linear UCB Multi-arm Bandit library
☆14Oct 5, 2016Updated 9 years ago
Alternatives and similar repositories for hybrid-linucb
Users that are interested in hybrid-linucb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dynamic channel allocation in cellular networks by reinforcement learning☆18May 25, 2022Updated 3 years ago
- Contextual bandit in python☆112Jul 7, 2021Updated 4 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Stream Data based News Recommendation - Contextual Bandit Approach☆47Nov 15, 2017Updated 8 years ago
- C++ implementation of a b-tree.☆13Aug 4, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Bandit algorithms☆30Oct 12, 2017Updated 8 years ago
- Hybrid Linear UCB bandit learning algorithm L Li(2010) python code☆56Dec 23, 2015Updated 10 years ago
- Capacity comparison between different power allocation schemes with arbitrary input distributions and different channel gains☆10Dec 19, 2018Updated 7 years ago
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 8 months ago
- An ITP Class☆19Nov 13, 2025Updated 5 months ago
- This repository includes the source code for simulating traffic in AIMSUN with autonomous vehicles.☆11Aug 4, 2017Updated 8 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- AI Powered Traffic Signal Control (BTS Global Hackathon)☆15Nov 19, 2018Updated 7 years ago
- Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]☆50Mar 15, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Some experimental scripts for running IQFeed on Debian GNU/Linux☆16Feb 16, 2014Updated 12 years ago
- A tool for detecting anomalies in time series data☆11Dec 1, 2022Updated 3 years ago
- 一个用于爬股票历史数据,并根据历史数据分析挖掘并对未来数据进行预测的项目☆16Oct 8, 2017Updated 8 years ago
- Library of contextual bandits algorithms☆341Mar 14, 2024Updated 2 years ago
- Contextual Bandit Algorithms (+Bandit Algorithms)☆22Oct 18, 2019Updated 6 years ago
- 股票高频数据(数据来源:新浪)☆13Jan 29, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository contains implementations of the paper VUSFA☆14Mar 31, 2021Updated 5 years ago
- Transformer-based Realtime User Action Model for Recommendation at Pinterest☆79Apr 13, 2023Updated 3 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Multi-thread implementation of Piece-wise Linear Model(PLM) or Mixture of LR(MLR) with FTRL for binary-class classification problem.☆129Jun 22, 2021Updated 4 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Jun 5, 2018Updated 7 years ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- Modelling bus-on-demand using SUMO and TraCI.☆19Apr 30, 2014Updated 11 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Python application to setup and run streaming (contextual) bandit experiments.☆85Sep 4, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Jul 23, 2023Updated 2 years ago
- This contains joint channel and power allocation scheme for a full duplex cognitive radio network underlying a cellular network☆25Oct 20, 2017Updated 8 years ago
- 批量下载论文☆18Jan 12, 2019Updated 7 years ago
- The Lightspeed eCom PHP API Client☆23May 19, 2025Updated 11 months ago
- An interactive story app for Android . . .☆15Dec 14, 2014Updated 11 years ago
- Explore a bidding strategy for ad auctions☆14Nov 11, 2024Updated last year
- Predict and recommend the news articles, user is most likely to click in real time.☆32Apr 3, 2018Updated 8 years ago