BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms
☆11Aug 7, 2021Updated 4 years ago
Alternatives and similar repositories for BATCH
Users that are interested in BATCH are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- The source code of INFless,a native serverless platform for AI inference.☆46Oct 10, 2022Updated 3 years ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 4 months ago
- Distributed Deep Learning Benchmark Suite☆11Oct 31, 2022Updated 3 years ago
- Model-less Inference Serving☆94Nov 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- a high performance server framework☆12Dec 11, 2022Updated 3 years ago
- A simulator toolkit for Knative☆31Apr 28, 2021Updated 4 years ago
- ☆27May 31, 2023Updated 2 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- A framework for trace-driven simulation of serverless Function-as-a-Service platforms☆76Jan 30, 2025Updated last year
- Network- and GPU-aware management of serverless functions at the edge☆15Mar 3, 2023Updated 3 years ago
- ☆10Oct 5, 2023Updated 2 years ago
- ☆15Nov 9, 2024Updated last year
- iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.☆39Jun 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A local k3s cluster with Load Balancer, Ingress Controller, and Cert Manager.☆13Jan 20, 2026Updated 2 months ago
- PyTorch Implementation of YOLOv3Tiny☆13Apr 24, 2021Updated 4 years ago
- ☆21May 13, 2022Updated 3 years ago
- Simple Github Action that prints the go version.☆15Sep 24, 2022Updated 3 years ago
- fine-tuning tutorial☆18Mar 14, 2026Updated 3 weeks ago
- Object as a Service (OaaS)☆16Dec 4, 2025Updated 4 months ago
- ☆85Feb 5, 2026Updated 2 months ago
- ☆18Apr 25, 2025Updated 11 months ago
- Fine-tuning GPT-2 to generate research paper abstracts☆12Apr 28, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Examples of inference pipelines implemented using https://github.com/SeldonIO/seldon-core☆14Feb 1, 2023Updated 3 years ago
- ☆26Feb 20, 2024Updated 2 years ago
- 9기 운영진을 위한 repo입니다.☆12Sep 22, 2024Updated last year
- ☆13Feb 2, 2021Updated 5 years ago
- ☆19Feb 28, 2022Updated 4 years ago
- ☆52Dec 13, 2022Updated 3 years ago
- AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference☆21Jan 24, 2025Updated last year
- Real-time IoT Benchmark Suite☆50Mar 25, 2018Updated 8 years ago
- ☆10Sep 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 票据识别,大赛地址:http://rrc.cvc.uab.es/?ch=13☆19Nov 21, 2022Updated 3 years ago
- PyTorch Implementation of CURL-Neural Network Pruning with Residual-Connections and Limited-Data☆24Jun 11, 2020Updated 5 years ago
- Amazon Elastic Inference tools and utilities.☆17Apr 8, 2020Updated 6 years ago
- Nano Banana 🍌 API MCP server☆34Nov 25, 2025Updated 4 months ago
- LLM의 다양한 튜닝 방법과 데이터 전처리 코드를 정리해놓았습니다.☆14Feb 23, 2026Updated last month
- ☆21Dec 30, 2022Updated 3 years ago
- Integration between knative and certmanager for managing TLS certs automatically.☆22Apr 26, 2024Updated last year