RL-Bakery makes it easy to build production, large scale, batch Deep Reinforcement Learning applications.
☆97Oct 15, 2024Updated last year
Alternatives and similar repositories for rl-bakery
Users that are interested in rl-bakery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆54Jul 28, 2019Updated 6 years ago
- Map maker is a command line tool and library for easily generating maps from structured data.☆16Mar 5, 2024Updated 2 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)☆11Sep 16, 2025Updated 7 months ago
- Materials for the virtual NIMBLE workshop, May 26-28, 2021. For logistical information, please look below the file listing.☆14May 28, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Reinforced Recommendation toolkit built around pytorch 1.7☆587Dec 8, 2020Updated 5 years ago
- Online Ranking with Multi-Armed-Bandits☆19Sep 4, 2021Updated 4 years ago
- Cache Commander — a TUI and MCP server to explore, audit, and clean developer cache directories. Scan for CVEs, find outdated packages, r…☆64Apr 21, 2026Updated 2 weeks ago
- Made for a reading group at the Center for Safe AGI.☆12Feb 23, 2026Updated 2 months ago
- Experimentation for oracle based contextual bandit algorithms.☆33Sep 12, 2022Updated 3 years ago
- ☆22Oct 26, 2022Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆60Apr 2, 2023Updated 3 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆25Nov 1, 2022Updated 3 years ago
- A Configurable Recommender Systems Simulation Platform☆784Jan 3, 2022Updated 4 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆27Jan 23, 2022Updated 4 years ago
- A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)☆3,698Updated this week
- ☆12Apr 3, 2026Updated last month
- ☆10Nov 4, 2019Updated 6 years ago
- ☆16Nov 7, 2020Updated 5 years ago
- ☆12Mar 6, 2020Updated 6 years ago
- Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".☆13Jan 25, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Machine learning evaluation database☆24Feb 7, 2018Updated 8 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- This network estimation procedure combines l1-regularized logistic regression with model selection based on the Extended Bayesian Informa…☆10Oct 12, 2023Updated 2 years ago
- ☆13Apr 25, 2024Updated 2 years ago
- Tools for training pytorch language models☆27Nov 14, 2020Updated 5 years ago
- SciFin is a python package for Science & Finance.☆11Oct 25, 2020Updated 5 years ago
- Modular Multi-Objective Reinforcement Learning with Decision Values☆25Dec 8, 2022Updated 3 years ago
- Big Data's open seminars: An Interactive Introduction to Reinforcement Learning☆14Nov 21, 2017Updated 8 years ago
- ☆12Sep 30, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A repository for code of reinforcement learning algorithms with PyTorch☆30Sep 20, 2021Updated 4 years ago
- Opens a scrollable window containing a list of Scenes (assigned by the user) that allows to easily load these Scenes (single or additive)…☆12Mar 16, 2023Updated 3 years ago
- A data processing module implemented with numpy☆10Aug 16, 2022Updated 3 years ago
- Official code for "Traffic Speed Imputation with Spatio-Temporal Attentions and Cycle-Perceptual Training" (CIKM'22).☆13Mar 8, 2024Updated 2 years ago
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 4 years ago
- Cooperation and Fairness in Multi-Agent Reinforcement Learning☆16Aug 6, 2025Updated 9 months ago
- ☆10Sep 19, 2023Updated 2 years ago