Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update
☆75Mar 3, 2026Updated last month
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contextual bandit benchmarking☆53Jan 21, 2026Updated 2 months ago
- Vowpal Wabbit examples and tutorials☆21Jan 20, 2022Updated 4 years ago
- Estimators to perform off-policy evaluation☆13Sep 3, 2024Updated last year
- Notes for the Neuroscience & AI Reading Course (SEM-I 2020-21) at BITS Pilani Goa Campus☆14Sep 30, 2020Updated 5 years ago
- Experimental new Python bindings for the VowpalWabbit library☆12Oct 5, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 5 years ago
- Repository for SAiDL Summer 2021 Induction Assignment☆21Jul 5, 2021Updated 4 years ago
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- Boiler plate code for Torch based ML projects☆10Jul 14, 2021Updated 4 years ago
- ☆16Jun 5, 2017Updated 8 years ago
- ☆15Jan 20, 2020Updated 6 years ago
- Reinforcement learning benchmarking.☆39Oct 22, 2018Updated 7 years ago
- Source code for our LBR paper "Closed-Form Models for Collaborative Filtering with Side-Information" published at RecSys 2020.☆15Jul 22, 2021Updated 4 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repository contains the scripts used during my participation on CIKM Cup 2016 (see http://cikmcup.org/ and https://competitions.coda…☆11Nov 4, 2016Updated 9 years ago
- Source code for our paper "Top-K Contextual Bandits with Equity of Exposure" published at RecSys 2021.☆15Aug 2, 2021Updated 4 years ago
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 5 years ago
- INTeractive learning via REPresentatIon Discovery☆36Jun 2, 2024Updated last year
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆11May 22, 2020Updated 5 years ago
- Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method☆32Jul 25, 2024Updated last year
- ☆18Nov 19, 2018Updated 7 years ago
- Accelerated Confergence for Counterfactual Learning to Rank☆17Jan 21, 2022Updated 4 years ago
- Reinforcement learning with a network of spiking agents☆22Jun 8, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Labs for the course on Meta Learning at BITS-Goa☆34Mar 13, 2021Updated 5 years ago
- Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.☆11Jul 12, 2018Updated 7 years ago
- Context Aware Language Models☆28Jul 3, 2018Updated 7 years ago
- Python implementations of contextual bandits algorithms☆825Feb 22, 2026Updated last month
- A reader that buffers ranged calls☆12May 17, 2022Updated 3 years ago
- OCaml bindings to libuv -- Cross-platform asychronous I/O☆18Jan 12, 2015Updated 11 years ago
- support kubernetes feature for autogen(https://github.com/microsoft/autogen)☆11Sep 15, 2025Updated 7 months ago
- Birkhoff decomposition for doubly stochastic matrices.☆14Sep 17, 2023Updated 2 years ago
- Experimentation for oracle based contextual bandit algorithms.☆33Sep 12, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The PyNN 0.8 interface to sPyNNaker.☆19Mar 15, 2022Updated 4 years ago
- Document context language models☆22Nov 13, 2015Updated 10 years ago
- Dynamic Movement Primitives in Python☆15Jul 6, 2023Updated 2 years ago
- Rectified Factor Networks☆37Oct 16, 2019Updated 6 years ago
- Code for "Revisiting classifier two-sample tests" (ICLR 2017).☆20Mar 13, 2018Updated 8 years ago
- ☆18Jul 20, 2023Updated 2 years ago
- A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibilit…☆412Dec 27, 2022Updated 3 years ago