Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)
β281Nov 3, 2023Updated 2 years ago
Alternatives and similar repositories for openmodelz
Users that are interested in openmodelz are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ποΈ Reproducible development environment for humans and agentsβ2,187Mar 5, 2026Updated 2 weeks ago
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)β277Oct 11, 2023Updated 2 years ago
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machineβ893Mar 1, 2026Updated 3 weeks ago
- This repository contains statistics about the AI Infrastructure products.β16Feb 27, 2025Updated last year
- With Dejavu, you can have a perfect memory by capturing and organizing your visual recordings efficiently.β132Sep 1, 2023Updated 2 years ago
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ154Oct 16, 2024Updated last year
- my bachelor's thesis in SJTU about https://github.com/caicloud/cycloneβ12Jan 4, 2018Updated 8 years ago
- β19Apr 11, 2024Updated last year
- Kexplain is an interactive kubectl explainβ12Oct 23, 2023Updated 2 years ago
- Docker for Your ML/DL Models Based on OCI Artifactsβ472Jan 26, 2024Updated 2 years ago
- An awesome & curated list of best LLMOps tools for developersβ5,668Feb 3, 2026Updated last month
- EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, delivβ¦β249Sep 17, 2023Updated 2 years ago
- Personal Blog in github.ioβ10Feb 25, 2026Updated last month
- Turn PostgreSQL into your search engine in a Pythonic way.β51Aug 29, 2025Updated 6 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharingβ11Apr 1, 2020Updated 5 years ago
- β145Dec 6, 2023Updated 2 years ago
- IBM Quantum Challenge Fall 2023β10May 23, 2023Updated 2 years ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprisesβ26Apr 24, 2025Updated 11 months ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clβ¦β9,664Updated this week
- An AI framework for building cool things.β211Jun 5, 2023Updated 2 years ago
- OpenAI compatible API for open source LLMsβ16Oct 30, 2023Updated 2 years ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.β1,266Mar 13, 2025Updated last year
- EpochFS is a versioned cloud file system with git-like branching, transaction support.β17Mar 11, 2026Updated last week
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.β2,162Feb 26, 2025Updated last year
- A toolkit to run Ray applications on Kubernetesβ2,388Updated this week
- Benchmark results from code generation with LLMsβ17Sep 1, 2023Updated 2 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.β531Mar 4, 2024Updated 2 years ago
- Machine Learning Projects with Flytekitβ36May 23, 2023Updated 2 years ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.β16Sep 29, 2024Updated last year
- Your AI Kubernetes Expertβ186Apr 6, 2023Updated 2 years ago
- An experimental tool to modify YAMLs without losing (most of) comment lines.β16Sep 25, 2022Updated 3 years ago
- A Survey of AI startupsβ402Aug 27, 2023Updated 2 years ago
- AI-based search done rightβ20Dec 25, 2025Updated 2 months ago
- dify η₯θ―εΊζ£η΄’ε·₯ε ·β13Apr 3, 2025Updated 11 months ago
- Generic prefix tree for golangβ13Apr 25, 2025Updated 10 months ago
- A powerful prompt template engine built upon Jinjaβ12Oct 22, 2025Updated 5 months ago
- Model Deployment at Scale on Kubernetes π¦οΈβ838May 8, 2024Updated last year
- convert GitHub issues to a websiteβ28Mar 2, 2026Updated 3 weeks ago
- PostgreSQL tokenizer extension for full-text searchβ38Sep 29, 2025Updated 5 months ago