Fast model deployment on any cloud π
β176Feb 25, 2024Updated 2 years ago
Alternatives and similar repositories for bentoctl
Users that are interested in bentoctl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model Deployment at Scale on Kubernetes π¦οΈβ838May 8, 2024Updated last year
- A small utility module to make it simple to build BentoML Services into images inside Kubernetes clusters.β10Dec 15, 2020Updated 5 years ago
- BentoML Example Projects π¨β142Jan 6, 2025Updated last year
- Fast model deployment on Google Cloud Runβ16Feb 25, 2024Updated 2 years ago
- covid question answering datasets and fine tuned modelsβ18Apr 27, 2021Updated 4 years ago
- Fast model deployment on AWS Sagemakerβ16Feb 25, 2024Updated 2 years ago
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,520Mar 16, 2026Updated last week
- Fast model deployment on AWS Lambdaβ14Feb 25, 2024Updated 2 years ago
- Simple dependency injection framework for Pythonβ21May 15, 2024Updated last year
- The simplest way to serve AI/ML models in productionβ1,127Updated this week
- An open-source ML pipeline development platformβ998Jan 9, 2025Updated last year
- Jett is a lightweight micro-framework for building Go HTTP services. Built on top of HttpRouter, enables subrouting and flexible additionβ¦β180Mar 17, 2023Updated 3 years ago
- Background task queue for Python backed by Redis, a super minimal Celeryβ591Feb 3, 2026Updated last month
- Repo accompanying the blog post "How to Deploy PyTorch Models with Core ML Conversion Issues"β14Jul 3, 2020Updated 5 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ36Jul 6, 2023Updated 2 years ago
- A build system built for speed and powerβ115Aug 23, 2022Updated 3 years ago
- A collection of resources to learn about MLOPs.β973Feb 26, 2023Updated 3 years ago
- NLP tool to extract emotional phrase from tweets π€©β40Oct 18, 2021Updated 4 years ago
- Online model serving with Fraud Detection model trained with XGBoost on IEEE-CIS datasetβ18Jun 26, 2023Updated 2 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from π€ datasets with Apache Beam and Dataβ¦β27Oct 20, 2022Updated 3 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`β12Jun 22, 2022Updated 3 years ago
- Deploy Your Own Stable Diffusion Serviceβ201Sep 29, 2024Updated last year
- UnionML: the easiest way to build and deploy machine learning microservicesβ336Nov 6, 2023Updated 2 years ago
- A voice-enabled chatbot application built using of π¦οΈπ LangChain, text-to-speech, and speech-to-text models from π€ Hugging Face, and β¦β194Nov 13, 2023Updated 2 years ago
- A CLI to create remote development environments in your cloud provider account in secondsβ628Nov 18, 2022Updated 3 years ago
- AI Connect for Scientific Data (AiCSD) is a new solution for using AI to connect data from scientific instruments to applicable AI pipeliβ¦β19Apr 3, 2025Updated 11 months ago
- An exploration comparing Flask with FastAPI.β18Jun 17, 2024Updated last year
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,281Updated this week
- Repository contains various Malayalam ASR based resources curated from multiple sourcesβ18Oct 1, 2021Updated 4 years ago
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,623May 29, 2025Updated 9 months ago
- An open source command line interface that runs checks on infrastructure as code to catch potential deployment issues before deploying.β474Oct 18, 2023Updated 2 years ago
- Render Markdown to HTML on any website using a md tagβ462Nov 9, 2022Updated 3 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharingβ11Apr 1, 2020Updated 5 years ago
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,170Jun 2, 2025Updated 9 months ago
- Semantic Segmentor for Protein Structures.β11Dec 20, 2021Updated 4 years ago
- β23Nov 1, 2022Updated 3 years ago
- β Puff β - The deep stack framework.β326Updated this week
- BigBertha is an architecture design that demonstrates how automated LLMOps (Large Language Models Operations) can be achieved on any Kubeβ¦β28Oct 27, 2023Updated 2 years ago
- Article about deploying machine learning models using grpc, pytorch and asyncioβ30Nov 18, 2022Updated 3 years ago