tensorchord / openmodelz
Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)
β253Updated last year
Alternatives and similar repositories for openmodelz:
Users that are interested in openmodelz are comparing it to the libraries listed below
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)β272Updated last year
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ137Updated 4 months ago
- π An awesome & curated list of best LLMOps tools.β41Updated 2 weeks ago
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistantβ53Updated 6 months ago
- EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, delivβ¦β250Updated last year
- A diverse, simple, and secure all-in-one LLMOps platformβ100Updated 5 months ago
- Self-host LLMs with vLLM and BentoMLβ90Updated this week
- Octogen is an Open-Source Code Interpreter Agent Frameworkβ254Updated 6 months ago
- Lepton Examplesβ141Updated 2 months ago
- ChatData π π brings RAG to real applications with FREEβ¨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milliβ¦β167Updated 3 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.β63Updated 10 months ago
- An open-source cloud-native of large multi-modal models (LMMs) serving framework.β161Updated last year
- Examples on how to use LangChain and Rayβ226Updated last year
- an MLOps/LLMOps platformβ225Updated 2 months ago
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated last year
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.β195Updated 2 years ago
- GPUd automates monitoring, diagnostics, and issue identification for GPUsβ281Updated this week
- β448Updated last year
- β53Updated 2 months ago
- Your AI Kubernetes Expertβ176Updated last year
- LLMPerf is a library for validating and benchmarking LLMsβ788Updated 2 months ago
- Turn any OCR models into online inference API endpoint π πβ53Updated 2 weeks ago
- A lightweight version of Milvusβ303Updated this week
- RayLLM - LLMs on Rayβ1,260Updated 9 months ago
- Open-source observability for your LLM application.β49Updated 2 months ago
- Model Deployment at Scale on Kubernetes π¦οΈβ795Updated 9 months ago
- Benchmarking suite for popular AI APIsβ81Updated 3 weeks ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated last month
- Finetune LLMs on K8s by using Runbooksβ170Updated 6 months ago
- β164Updated this week