BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms
☆11Aug 7, 2021Updated 4 years ago
Alternatives and similar repositories for BATCH
Users that are interested in BATCH are comparing it to the libraries listed below
Sorting:
- The source code of INFless,a native serverless platform for AI inference.☆46Oct 10, 2022Updated 3 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- ☆15Aug 15, 2024Updated last year
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆35Nov 18, 2025Updated 3 months ago
- A simulator toolkit for Knative☆31Apr 28, 2021Updated 4 years ago
- ☆27May 31, 2023Updated 2 years ago
- Model-less Inference Serving☆94Nov 4, 2023Updated 2 years ago
- ☆12Sep 25, 2019Updated 6 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆27Jun 7, 2023Updated 2 years ago
- A framework for trace-driven simulation of serverless Function-as-a-Service platforms☆76Jan 30, 2025Updated last year
- Prefix-Aware Attention for LLM Decoding☆29Jan 23, 2026Updated last month
- ☆33Dec 23, 2025Updated 2 months ago
- An alternative to OpenFaaS nats-queue-worker for long-running functions☆11Dec 14, 2022Updated 3 years ago
- Nonblocking data structures☆12Jan 25, 2015Updated 11 years ago
- ☆12Jan 12, 2024Updated 2 years ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 2 months ago
- ☆11Feb 17, 2026Updated last week
- Implementation of GuP [Arai+ SIGMOD'23]☆10Jan 10, 2024Updated 2 years ago
- ☆16Apr 8, 2022Updated 3 years ago
- Network- and GPU-aware management of serverless functions at the edge☆15Mar 3, 2023Updated 2 years ago
- kerf is a tool designed to orchestrate and manage multiple kernel instances on a single host.☆25Jan 23, 2026Updated last month
- ☆11Apr 24, 2018Updated 7 years ago
- a high performance server framework☆12Dec 11, 2022Updated 3 years ago
- A coöperative multitasking framework based on `liburing` and `libucontext`☆16Jan 2, 2026Updated 2 months ago
- Query, analysis, and visualization of large video collections☆10Dec 9, 2022Updated 3 years ago
- A cross-platform library for retrieving information about connected devices.☆11Sep 5, 2023Updated 2 years ago
- Command-line utility for iteratively developing pipelines, deploying them at scale, and sharing data and derivatives☆10Jun 15, 2020Updated 5 years ago
- Providing wrapper types for safely performing panic-free checked arithmetic on instants and durations.☆17Feb 7, 2026Updated 3 weeks ago
- KPC-Toolbox: MATLAB toolbox to fit Markovian Arrival Processes☆10Jun 12, 2025Updated 8 months ago
- ☆12Apr 26, 2023Updated 2 years ago
- Go Version of Redis on PMEM☆12Dec 20, 2021Updated 4 years ago
- ☆13Aug 22, 2025Updated 6 months ago
- iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.☆39Jun 11, 2024Updated last year
- Library to support modern C development.☆11Jan 29, 2024Updated 2 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Lithops-based Serverless implementation of the METASPACE spatial metabolomics annotation pipeline☆12Jul 6, 2023Updated 2 years ago
- Examples of inference pipelines implemented using https://github.com/SeldonIO/seldon-core☆14Feb 1, 2023Updated 3 years ago
- A definitive guide to build Tensorflow with Intel MKL support on Mac☆16Mar 28, 2018Updated 7 years ago
- Lock-free buddy allocator based on binary heap☆13Mar 3, 2025Updated 11 months ago