tensorchord / openmodelzLinks

Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)

☆270

Alternatives and similar repositories for openmodelz

Users that are interested in openmodelz are comparing it to the libraries listed below

Sorting:

tensorchord / modelz-llm
OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
☆275Updated last year
tensorchord / ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆148Updated 9 months ago
kubeagi / arcadia
A diverse, simple, and secure all-in-one LLMOps platform
☆107Updated 10 months ago
index-labs / evalgpt
EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, deliv…
☆249Updated last year
bentoml / BentoOCR
Turn any OCR models into online inference API endpoint 🚀 🌖
☆57Updated 4 months ago
myscale / myscale-telemetry
Open-source observability for your LLM application.
☆53Updated 7 months ago
jina-ai / rungpt
An open-source cloud-native of large multi-modal models (LMMs) serving framework.
☆167Updated last year
muyuworks / myla
a local implementation of OpenAI Assistants API: myla stands for MY Local Assistant
☆55Updated 11 months ago
asprenger / ray_vllm_inference
A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
☆69Updated last year
star-whale / starwhale
an MLOps/LLMOps platform
☆230Updated 7 months ago
intel / llm-on-ray
Pretrain, finetune and serve LLMs on Intel platforms with Ray
☆128Updated 3 weeks ago
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆139Updated this week
ray-project / langchain-ray
Examples on how to use LangChain and Ray
☆229Updated 2 years ago
tensorchord / ai-infra-statistics
This repository contains statistics about the AI Infrastructure products.
☆18Updated 5 months ago
vtuber-plan / olah
Self-hosted huggingface mirror service. 自建huggingface镜像服务。
☆184Updated 2 weeks ago
leptonai / examples
Lepton Examples
☆141Updated 2 weeks ago
01-ai / Descartes
☆111Updated last year
leptonai / gpud
GPUd automates monitoring, diagnostics, and issue identification for GPUs
☆405Updated this week
milvus-io / milvus-lite
A lightweight version of Milvus
☆355Updated this week
substratusai / sandboxai
Run AI generated code in isolated sandboxes
☆90Updated 5 months ago
amogkam / llama_index_ray
Using LlamaIndex with Ray for productionizing LLM applications
☆71Updated 2 years ago
tensorchord / vechord
Turn PostgreSQL into your search engine in a Pythonic way.
☆47Updated last week
huggingface / xet-core
xet client tech, used in huggingface_hub
☆148Updated this week
substratusai / vllm-docker
☆63Updated 4 months ago
skai-x / elastic-jupyter-operator
Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.
☆201Updated 3 years ago
fixie-ai / ai-benchmarks
Benchmarking suite for popular AI APIs
☆87Updated 5 months ago
limcheekin / open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
☆156Updated last year
InftyAI / Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.
☆141Updated last week
zozoheir / tinyllm
Develop, evaluate and monitor LLM applications at scale
☆100Updated 8 months ago
myscale / ChatData
ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milli…
☆176Updated 8 months ago