Python implementation of UCB, EXP3 and Epsilon greedy algorithms
☆31Oct 4, 2018Updated 7 years ago
Alternatives and similar repositories for Multi-Armed-Bandit-Algorithms
Users that are interested in Multi-Armed-Bandit-Algorithms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The model recommends a set of books to user based on Machine Learning Techniques☆13Aug 17, 2019Updated 6 years ago
- This repository contains the code for paper Li, Ran, et al. "Decision-oriented learning for future power system decision-making under unc…☆32Apr 13, 2025Updated last year
- The code for Isolation Mondrian (iMondrian) forest for batch and online anomaly detection☆10Feb 24, 2021Updated 5 years ago
- ☆15Nov 4, 2021Updated 4 years ago
- [ICANN 2022] ''An Improved Lightweight YOLOv5 Model Based on Attention Mechanism for Face Mask Detection'' Official Code☆10Feb 27, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆17May 21, 2018Updated 7 years ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- Experiments of the DAI in Healthcare project - skin lesions images use case - using Flower☆12Jun 16, 2022Updated 3 years ago
- Temporal IMLinUCB - a solution for Online Influence Maximization problem in Temporal Networks (based on IMLinUCB)☆17May 3, 2024Updated last year
- Official write-up for Speed Hack event at POC2017☆15Nov 11, 2017Updated 8 years ago
- ☆14Sep 19, 2023Updated 2 years ago
- [ICLR 2025] "Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning"☆16Nov 30, 2025Updated 5 months ago
- Mon(IoT)r Lab Testbed Software - Core Component☆13Aug 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Dec 22, 2024Updated last year
- 《自然语言处理——基于预训练模型的方法》全书代码实现☆12Jan 16, 2023Updated 3 years ago
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆31Mar 30, 2026Updated last month
- Misc resources for my daily pentesting...☆19Mar 26, 2025Updated last year
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆35Feb 21, 2026Updated 2 months ago
- Runtime verification tool for Solidity smart contracts.☆34Mar 29, 2023Updated 3 years ago
- brute but stronger☆11Aug 4, 2022Updated 3 years ago
- Examples in "Rust Essentials" Book☆10Jun 17, 2016Updated 9 years ago
- ☆56Aug 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Dec 24, 2021Updated 4 years ago
- ☆15Mar 2, 2025Updated last year
- Set up files for the OctaPi☆24Sep 9, 2019Updated 6 years ago
- A Compiler made with python using some useful libreries.☆11Nov 11, 2019Updated 6 years ago
- A V8 Sandbox Escape Technique.☆21Feb 8, 2025Updated last year
- Gather research papers and corresponding codes about Generalizable Robot Manipulation for Embodied Intelligence.☆32Mar 16, 2026Updated last month
- A framework for QuickTune☆16Jan 9, 2025Updated last year
- ☆16Mar 9, 2019Updated 7 years ago
- ☆18Nov 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆50Feb 5, 2026Updated 2 months ago
- Python JIT transpiler to C++☆15Jan 14, 2020Updated 6 years ago
- [NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…☆26Feb 15, 2025Updated last year
- ☆21Jan 21, 2026Updated 3 months ago
- A collection of resources and information about CVE-2023-2033☆19Aug 13, 2023Updated 2 years ago
- Magicwand tool to generate tcp traffic data☆24May 26, 2021Updated 4 years ago
- Hinton's Forward-Forward Algorithm for Deep Learning☆10Feb 6, 2023Updated 3 years ago