Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)
☆281Nov 3, 2023Updated 2 years ago
Alternatives and similar repositories for openmodelz
Users that are interested in openmodelz are comparing it to the libraries listed below
Sorting:
- 🏕️ Reproducible development environment for humans and agents☆2,184Updated this week
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆892Updated this week
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)☆276Oct 11, 2023Updated 2 years ago
- ☆19Apr 11, 2024Updated last year
- OpenDAL fsspec integration☆34Jan 20, 2026Updated last month
- With Dejavu, you can have a perfect memory by capturing and organizing your visual recordings efficiently.☆132Sep 1, 2023Updated 2 years ago
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆154Oct 16, 2024Updated last year
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- Turn PostgreSQL into your search engine in a Pythonic way.☆51Aug 29, 2025Updated 6 months ago
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- Docker for Your ML/DL Models Based on OCI Artifacts☆474Jan 26, 2024Updated 2 years ago
- An awesome & curated list of best LLMOps tools for developers☆5,645Feb 3, 2026Updated last month
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- Generic prefix tree for golang☆13Apr 25, 2025Updated 10 months ago
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Feb 3, 2026Updated last month
- ☆145Dec 6, 2023Updated 2 years ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 10 months ago
- EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, deliv…☆248Sep 17, 2023Updated 2 years ago
- An experimental tool to modify YAMLs without losing (most of) comment lines.☆16Sep 25, 2022Updated 3 years ago
- Model Deployment at Scale on Kubernetes 🦄️☆836May 8, 2024Updated last year
- PostgreSQL tokenizer extension for full-text search☆37Sep 29, 2025Updated 5 months ago
- Automated, schema-based JSON unpacking to Polars objects☆13Sep 14, 2025Updated 5 months ago
- An AI framework for building cool things.☆211Jun 5, 2023Updated 2 years ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, o…☆9,516Updated this week
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,158Feb 26, 2025Updated last year
- Apache OpenDAL Go Binding Services Releases☆14Sep 11, 2025Updated 5 months ago
- a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮☆463Jan 4, 2025Updated last year
- Your AI Kubernetes Expert☆186Apr 6, 2023Updated 2 years ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆16Sep 29, 2024Updated last year
- 中国开发者活动日程(关注点:开源、开发者、云原生)☆23Updated this week
- AI-based search done right☆20Dec 25, 2025Updated 2 months ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,267Mar 13, 2025Updated 11 months ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆289Jan 26, 2026Updated last month
- Benchmark results from code generation with LLMs☆17Sep 1, 2023Updated 2 years ago
- ☆16May 4, 2021Updated 4 years ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆44Sep 15, 2021Updated 4 years ago
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,951Jul 11, 2025Updated 7 months ago
- A Survey of AI startups☆403Aug 27, 2023Updated 2 years ago