zynga / rl-bakeryView external linksLinks
RL-Bakery makes it easy to build production, large scale, batch Deep Reinforcement Learning applications.
☆96Oct 15, 2024Updated last year
Alternatives and similar repositories for rl-bakery
Users that are interested in rl-bakery are comparing it to the libraries listed below
Sorting:
- Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)☆11Sep 16, 2025Updated 4 months ago
- Made for a reading group at the Center for Safe AGI.☆12Oct 27, 2022Updated 3 years ago
- Map maker is a command line tool and library for easily generating maps from structured data.☆16Mar 5, 2024Updated last year
- Config files for setting up Multitenant Kubeflow on AWS with spot instances☆10Sep 15, 2020Updated 5 years ago
- Non-stationary Off-policy Evaluation☆13Nov 8, 2018Updated 7 years ago
- Causal Analysis of Agent Behavior for AI Safety☆19Jun 27, 2023Updated 2 years ago
- Simulation environments for Multi-Objective Reinforcement Learning (MORL)☆17Aug 2, 2022Updated 3 years ago
- Code for the paper "Batch size invariance for policy optimization"☆56Apr 2, 2023Updated 2 years ago
- Online Ranking with Multi-Armed-Bandits☆19Sep 4, 2021Updated 4 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago
- This workshop was done as a part of the 1729 conference organized by Fractal Analytics and Analytics Vidhya. Key content covered was hand…☆22Jul 7, 2022Updated 3 years ago
- OpenAI Gym Environment for Low-Latency Trading☆18Jun 15, 2018Updated 7 years ago
- Federated Learning Infra Architecture on Kubernetes(EKS)☆20Nov 18, 2019Updated 6 years ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆23Jul 6, 2023Updated 2 years ago
- ☆21Nov 9, 2021Updated 4 years ago
- A Library for Modelling Probabilistic Hierarchical Graphical Models in PyTorch☆49Aug 7, 2020Updated 5 years ago
- Environment for OpenAI Gym which can simulate an app deployed to a cloud environment.☆16Aug 28, 2023Updated 2 years ago
- A leaderboard of human and machine performance on the Arcade Learning Environment (ALE).☆21Aug 27, 2018Updated 7 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆27Jan 23, 2022Updated 4 years ago
- Modular Multi-Objective Reinforcement Learning with Decision Values☆25Dec 8, 2022Updated 3 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Dec 8, 2020Updated 5 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Sep 10, 2025Updated 5 months ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- Verilog code for a low power RFID chip that will communicate with I2C sensors.☆13Apr 18, 2014Updated 11 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Physical Downlink Shared Channel (PDSCH) in 5G New Radio.☆12Jan 29, 2024Updated 2 years ago
- 基于GSConv+SlimNeck的YOLOv5的消防通道占用检测系统☆10Nov 24, 2023Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆85Nov 27, 2023Updated 2 years ago
- A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)☆3,682Updated this week
- Fast Differentiable Forest lib with the advantages of both decision trees and neural networks☆78Nov 21, 2021Updated 4 years ago
- An Airflow plugin, providing an admin UI to conveniently start backfills. Usable with Airflow 1, 2 and Cloud Composer☆14Aug 16, 2022Updated 3 years ago
- Source code for the "Computationally Tractable Riemannian Manifolds for Graph Embeddings" paper☆37Jun 11, 2020Updated 5 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- SIR, SEIR, and beyond☆10Jul 6, 2023Updated 2 years ago
- SPGD: Search Party Gradient Descent algorithm, a Simple Gradient-Based Parallel Algorithm for Bound-Constrained Optimization. Link: http…☆10Oct 28, 2023Updated 2 years ago
- ☆10Apr 5, 2024Updated last year
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆32Apr 25, 2022Updated 3 years ago
- Code for experimenting with load-balancing intradomain traffic engineering using GNNs and RL. Project as part of masters degree at the Un…☆38Jan 12, 2021Updated 5 years ago
- ☆37Mar 31, 2020Updated 5 years ago