InftyAI / Awesome-LLMOps
π An awesome & curated list of best LLMOps tools.
β40Updated last week
Alternatives and similar repositories for Awesome-LLMOps:
Users that are interested in Awesome-LLMOps are comparing it to the libraries listed below
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β69Updated this week
- A diverse, simple, and secure all-in-one LLMOps platformβ98Updated 5 months ago
- OpenAI compatible API for open source LLMsβ15Updated last year
- Knowledge for GPTScriptβ29Updated 3 months ago
- Self-host LLMs with vLLM and BentoMLβ87Updated this week
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β20Updated 2 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.β62Updated 10 months ago
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated 8 months ago
- π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.β26Updated last month
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β49Updated this week
- API Extensions for core KubeVela.β11Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ136Updated 4 months ago
- Repository hosting Langchain helm charts.β44Updated this week
- β53Updated last month
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β62Updated 3 weeks ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β251Updated last year
- Document parser for RAGβ20Updated 3 months ago
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β139Updated this week
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.β9Updated last year
- Open-source observability for your LLM application.β48Updated last month
- β45Updated last year
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, iβ¦β12Updated last year
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated last year
- GenAI inference performance benchmarking toolβ17Updated this week
- β23Updated last week
- Nexusflow function call, tool use, and agent benchmarks.β19Updated 2 months ago
- Experiments with open source LLMsβ72Updated this week
- Automated pull requests reviewing and issues triaging with ChatGPT.β70Updated last year