☆25Apr 29, 2023Updated 2 years ago
Alternatives and similar repositories for practical-bandits-tutorial
Users that are interested in practical-bandits-tutorial are comparing it to the libraries listed below
Sorting:
- A simple multicohort LTV calculator for subscriptions☆11Mar 7, 2023Updated 3 years ago
- A tool for detecting anomalies in time series data☆11Dec 1, 2022Updated 3 years ago
- ☆13Aug 10, 2023Updated 2 years ago
- Package for building Market Segmentation Trees, Choice Model Trees, and Isotonic Regression Trees☆17Apr 21, 2023Updated 2 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Reproducible & Collaborative Data Science, Fall 2017 - Main class website☆23Jan 1, 2018Updated 8 years ago
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago
- Personal web page.☆27Updated this week
- Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation☆694Jun 3, 2024Updated last year
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 2 years ago
- ☆10Nov 22, 2020Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- TD-Regularized Actor-Critic Methods☆36Dec 26, 2019Updated 6 years ago
- A toolbox for simulating and estimating long-term causal effects in the presence of unobserved confounding.☆14Feb 20, 2023Updated 3 years ago
- A ProCyclingStats (PCS) data scraper. It fetches and parses HTML pages to end up building different model entities that will be serialize…☆11Updated this week
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- ☆10Nov 21, 2022Updated 3 years ago
- Contextual Bandit Spectral Representation Learner☆12Oct 25, 2022Updated 3 years ago
- A set of protocols for remote connection, for two people to connect while apart.☆10Sep 20, 2022Updated 3 years ago
- ☆11Aug 28, 2024Updated last year
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35May 24, 2018Updated 7 years ago
- [IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library☆279Sep 5, 2024Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Aug 27, 2022Updated 3 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆44Apr 28, 2021Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- ☆12Apr 2, 2024Updated last year
- Layered distributions using FLAX/JAX☆10Dec 13, 2020Updated 5 years ago
- A library to create lore plots (logistic regression of the prevalence of a categorical variable in function of a continuous feature)☆18Mar 1, 2026Updated last week
- Python bindings for OptFrame C++ Functional Core☆13May 18, 2025Updated 9 months ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 6 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Hierarchical Forecasting at Scale☆16Mar 18, 2024Updated last year
- Reinforcement Learning☆12Jun 22, 2017Updated 8 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- A lightweight, dependency-free (besides `libcurl`) command-line tool written in C to download the transcript of any YouTube video. It dir…☆21Aug 25, 2025Updated 6 months ago
- Materials for the "Recommender Systems through the lens of Decision Theory" tutorial delivered at the 30th Web Conference (WWW '21).☆11Apr 13, 2021Updated 4 years ago
- 🎮 A configurable Breakout environment for reinforcement learning☆11Mar 20, 2018Updated 7 years ago