banditml / offline-policy-evaluationLinks
Implementations and examples of common offline policy evaluation methods in Python.
☆224Updated 2 years ago
Alternatives and similar repositories for offline-policy-evaluation
Users that are interested in offline-policy-evaluation are comparing it to the libraries listed below
Sorting:
- RL-Bakery makes it easy to build production, large scale, batch Deep Reinforcement Learning applications.☆95Updated last year
- A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.☆69Updated 4 years ago
- [IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library☆267Updated last year
- ☆106Updated 4 years ago
- Contextual bandit benchmarking☆52Updated 5 months ago
- ☆32Updated 9 months ago
- A Python sandbox for decision making in dynamics☆422Updated 2 years ago
- ☆50Updated last year
- ☆51Updated 4 years ago
- Python implementations of contextual bandits algorithms☆811Updated 5 months ago
- RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems☆124Updated 3 years ago
- AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online adver…☆181Updated this week
- Contextual bandit in python☆112Updated 4 years ago
- ☆316Updated 2 years ago
- Multi-Armed Bandit Algorithms Library (MAB)☆133Updated 3 years ago
- Online Ranking with Multi-Armed-Bandits☆19Updated 4 years ago
- ☆44Updated 3 years ago
- 🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorith…☆415Updated last year
- PyTorch port and extension of the Deep Bayesian Bandits Library☆43Updated 6 years ago
- Simple but Flexible Recommendation Engine in PyTorch☆133Updated 3 years ago
- ☆25Updated 2 years ago
- Big Data's open seminars: An Interactive Introduction to Reinforcement Learning☆63Updated 4 years ago
- UpliftML: A Python Package for Scalable Uplift Modeling☆327Updated 2 years ago
- Implementation of statistical models to analyze time lagged conversions☆263Updated last year
- Library of contextual bandits algorithms☆335Updated last year
- working example of a contextual multi-armed bandit☆55Updated 6 years ago
- Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset☆57Updated 5 years ago
- Code for reco-gym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising☆480Updated 4 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆78Updated 2 years ago
- [AAAI 2024] Mab2Rec: Multi-Armed Bandits Recommender☆156Updated last year