RobRomijnders/bandit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RobRomijnders/bandit)

RobRomijnders / bandit

Implementation of Counterfactual risk minimization

☆26

Alternatives and similar repositories for bandit

Users that are interested in bandit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hongyanz / Stackelberg-GAN
View on GitHub
Codes for Stackelberg GAN
☆15Apr 23, 2019Updated 7 years ago
aicenter / TensorCFR
View on GitHub
☆10Feb 28, 2019Updated 7 years ago
dmitryhd / lightfm
View on GitHub
A Python implementation of LightFM, a hybrid recommendation algorithm.
☆14Nov 3, 2017Updated 8 years ago
sotte / pydata_berlin_2017_active_learning
View on GitHub
☆12Jul 9, 2017Updated 9 years ago
JuliaPOMDP / TabularTDLearning.jl
View on GitHub
Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA
☆12Nov 16, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pemami4911 / sinkhorn-policy-gradient.pytorch
View on GitHub
Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"
☆41Aug 27, 2018Updated 7 years ago
rampeer / py-parallelize
View on GitHub
Parallelize your computations in parallel-apply fashion.
☆33Jul 19, 2019Updated 7 years ago
tianbingsz / SVRG
View on GitHub
Stochastic Variance Reduction Policy Gradient Estimation
☆11Nov 6, 2018Updated 7 years ago
yudasong / Reinforcement-Learning-Branch-and-Bound
View on GitHub
☆16Sep 4, 2018Updated 7 years ago
vignesh-viswanathan / Bayesian-Stackelberg-Games
View on GitHub
The three algorithms used to solve Bayesian Stackelberg Games have been implemented here.
☆28Aug 9, 2018Updated 7 years ago
crowdAI / crowdai-criteo-ad-placement-challenge-starter-kit
View on GitHub
Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge
☆18Nov 10, 2017Updated 8 years ago
takscape / cmecab-java
View on GitHub
A Java binding for MeCab
☆11Nov 24, 2020Updated 5 years ago
hiroki13 / neural-pasa-system
View on GitHub
☆13Apr 23, 2017Updated 9 years ago
wenty2015 / Predicting-Clinical-Events-via-Recurrent-Neural-Networks
View on GitHub
☆12Dec 19, 2016Updated 9 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kefirski / hybrid_rvae
View on GitHub
pytorch implementation of "A Hybrid Convolutional Variational Autoencoder for Text Generation" Paper
☆36Jun 25, 2017Updated 9 years ago
annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 9 years ago
ssokota / mmd
View on GitHub
Code for magnetic mirror descent.
☆20Oct 5, 2023Updated 2 years ago
igsor / HDPy
View on GitHub
Heuristic Dynamic Programming with Python
☆14Jul 28, 2014Updated 12 years ago
olivierjeunen / dual-bandit-kdd-2020
View on GitHub
Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.
☆23Jul 6, 2023Updated 3 years ago
CongWeilin / cluster-loss-tensorflow
View on GitHub
This a an impletation of Deep Metric Learning via Facility Location on tensorflow
☆35Nov 27, 2017Updated 8 years ago
mxbi / ftim
View on GitHub
Feature-Time Instability Metric
☆44Jul 14, 2016Updated 10 years ago
FredericGodin / ContextualDecomposition-NLP
View on GitHub
This project contains the necessary files to reproduce the paper: "Explaining Character-Aware Neural Networks for Word-Level Prediction: …
☆12Nov 15, 2018Updated 7 years ago
rocket9-code / mlflow-deployment-controller
View on GitHub
Listens MLFlow model registry changes and deploy models based on configurations
☆20Jun 11, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mewmew / float
View on GitHub
Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…
☆23Dec 12, 2021Updated 4 years ago
facebookresearch / rela
View on GitHub
Reinforcement Learning Assembly
☆94Sep 2, 2021Updated 4 years ago
WLDCH / covid19-deaths-prediction
View on GitHub
Predict the number of deaths due to covid19 in the next two weeks
☆11Oct 2, 2022Updated 3 years ago
ethancaballero / Skip-Thought_Memory_Networks
View on GitHub
Question Answering system based on Skip-Thought Memory Networks
☆17Mar 25, 2020Updated 6 years ago
jiagengjie / Estimating-Genetic-Parameters
View on GitHub
☆10Jun 29, 2021Updated 5 years ago
alexeygrigorev / nips-ad-placement-challenge
View on GitHub
The winning solution to the Ad Placement Challenge (NIPS'17 Causal Inference and Machine Learning Workshop)
☆38Dec 10, 2017Updated 8 years ago
cloudml / SparkTree
View on GitHub
☆14Aug 26, 2016Updated 9 years ago
Jeff-HOU / UROP-Adversarial-Feature-Matching-for-Text-Generation
View on GitHub
My first UROP project
☆23Dec 2, 2017Updated 8 years ago
zalanborsos / online-variance-reduction
View on GitHub
Online Variance Reduction
☆15May 9, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
etali / emf
View on GitHub
Word Embedding Revisted: Explicit Matrix Factorization
☆33Sep 12, 2017Updated 8 years ago
andyliu42 / Counterfactual_Regret_Minimization_Python
View on GitHub
Counterfactual Regret Minimization (CFR) sample code in Python
☆14Apr 16, 2019Updated 7 years ago
timvieira / lazygrad
View on GitHub
Lazily regularized updates for Adagrad with sparse features. Implemented in Cython for efficiency.
☆11Jan 2, 2021Updated 5 years ago
sinagolara / VRP
View on GitHub
Adaptive Heuristic Method Based on SA and LNS for Solving Vehicle Routing Problem
☆13Oct 9, 2017Updated 8 years ago
sayalaruano / BetaLactLigPred-ML
View on GitHub
Prediction of the activity of molecules/ligands that have been tested to bind or not bind to Beta-Lactamases using machine learning cl…
☆10Mar 5, 2026Updated 4 months ago
ermongroup / best-arm-delayed
View on GitHub
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆20Apr 3, 2018Updated 8 years ago
AntoinePassemiers / Stochastic-Unit-Commitment
View on GitHub
Stochastic Unit Commitment for Renewable Energy Supply using Lagrangian Decomposition
☆34Jun 3, 2018Updated 8 years ago