Implementation of various multi-armed bandits algorithms on a 10-arm testbed.
☆38Jan 16, 2020Updated 6 years ago
Alternatives and similar repositories for MultiArmedBandit_RL
Users that are interested in MultiArmedBandit_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Play with the solutions to the multi-armed-bandit problem.☆417May 21, 2024Updated last year
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 6 years ago
- A collection of various projects related to Reinforcement Learning☆19Feb 22, 2021Updated 5 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Zen-NAS, a lightning fast, training-free Neural Architecture Searching algorithm☆11Nov 12, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- StochOptim provides user friendly functions to solve optimization problems using stochastic algorithms☆10Feb 9, 2018Updated 8 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- S-ROCKET: SELECTIVE RANDOM CONVOLUTION KERNELS FOR TIME SERIES CLASSIFICATION☆13Jan 3, 2023Updated 3 years ago
- Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: F…☆17May 21, 2018Updated 7 years ago
- Aggr extension for Tradingview☆18Mar 18, 2025Updated last year
- JNIEasy - Java Native Objects based on JNI☆10Aug 30, 2023Updated 2 years ago
- High performance Rust API for KDB+☆13Jun 27, 2021Updated 4 years ago
- q to Excel integration☆18Apr 16, 2018Updated 7 years ago
- KDB-Rust embedding and IPC☆10Sep 11, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Feature selection for machine learning using mutual information.☆15Dec 4, 2024Updated last year
- ☆12Mar 14, 2023Updated 3 years ago
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- MultiscaleGraphSignalTransforms.jl is a collection of software tools written in the Julia programming language for graph signal processin…☆12Mar 15, 2026Updated 2 weeks ago
- Code for the figures in Chapter 13 of "Reinforcement Learning: An Introduction" by Sutton and Barto☆14Jul 6, 2023Updated 2 years ago
- SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization☆11Aug 12, 2020Updated 5 years ago
- ☆10Sep 30, 2017Updated 8 years ago
- Finance Technical Indicators optimized with Numba☆11Mar 15, 2018Updated 8 years ago
- Python + Numpy + Scipy Implementation of LARS and LASSO☆12Oct 19, 2010Updated 15 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SC 2021, "LogECMem: Coupling Erasure-Coded In-Memory Key-Value Stores with Parity Logging"☆12Jul 12, 2021Updated 4 years ago
- Cluster Images using Perceptual Hash☆13Apr 22, 2016Updated 9 years ago
- ☆50Jan 30, 2026Updated 2 months ago
- ☆17Mar 28, 2024Updated 2 years ago
- A framework for a KDB back end to a Slack bot☆13Nov 11, 2022Updated 3 years ago
- Capsule Routing for Named Entity Recognition☆10Dec 22, 2020Updated 5 years ago
- Trading with ML on binance microstructure market data☆15Dec 29, 2023Updated 2 years ago
- ☆12May 27, 2022Updated 3 years ago
- ☆32Oct 21, 2025Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Baselines of metric learning method using PyTorch.☆11Oct 18, 2021Updated 4 years ago
- On Lipschitz Regularization of Convolutional Layers using Toeplitz Matrix Theory☆10Aug 19, 2021Updated 4 years ago
- Reverse Engineering Imperceptible Backdoor Attacks on Deep Neural Networks for Detection and Training Set Cleansing☆14Feb 18, 2021Updated 5 years ago
- Code of "Visualizing and Understanding Object Detecor"☆20Jun 24, 2021Updated 4 years ago
- qzmq provides Q bindings for CZMQ, the high-level C binding for ØMQ☆22Feb 5, 2018Updated 8 years ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆20Sep 18, 2025Updated 6 months ago
- A comparasion among different variant of gradient descent algorithm☆25Mar 31, 2017Updated 8 years ago