Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)
☆282Nov 3, 2023Updated 2 years ago
Alternatives and similar repositories for openmodelz
Users that are interested in openmodelz are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🏕️ Reproducible development environment for humans and agents☆2,200Updated this week
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)☆277Oct 11, 2023Updated 2 years ago
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆899Updated this week
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- With Dejavu, you can have a perfect memory by capturing and organizing your visual recordings efficiently.☆132Sep 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆156Oct 16, 2024Updated last year
- OpenDAL fsspec integration☆35Jan 20, 2026Updated 3 months ago
- my bachelor's thesis in SJTU about https://github.com/caicloud/cyclone☆12Jan 4, 2018Updated 8 years ago
- ☆19Apr 11, 2024Updated 2 years ago
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- Docker for Your ML/DL Models Based on OCI Artifacts☆474Jan 26, 2024Updated 2 years ago
- An awesome & curated list of best LLMOps tools for developers☆5,764Apr 6, 2026Updated 3 weeks ago
- Personal Blog in github.io☆10Feb 25, 2026Updated 2 months ago
- Turn PostgreSQL into your search engine in a Pythonic way.☆52Aug 29, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- ☆145Dec 6, 2023Updated 2 years ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated last year
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ cl…☆9,923Updated this week
- An AI framework for building cool things.☆211Jun 5, 2023Updated 2 years ago
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,267Mar 13, 2025Updated last year
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Apr 23, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Create informative READMEs effortlessly using AI-driven templates with the README Creator powered by Language Model (LLM). Simplify docum…☆13Aug 11, 2023Updated 2 years ago
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,172Feb 26, 2025Updated last year
- Benchmark results from code generation with LLMs☆17Sep 1, 2023Updated 2 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- Your AI Kubernetes Expert☆186Apr 6, 2023Updated 3 years ago
- a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮☆467Jan 4, 2025Updated last year
- An experimental tool to modify YAMLs without losing (most of) comment lines.☆16Sep 25, 2022Updated 3 years ago
- A Survey of AI startups☆402Aug 27, 2023Updated 2 years ago
- AI-based search done right☆20Dec 25, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆302Jan 26, 2026Updated 3 months ago
- A powerful prompt template engine built upon Jinja☆12Oct 22, 2025Updated 6 months ago
- Model Deployment at Scale on Kubernetes 🦄️☆839May 8, 2024Updated last year
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆8,007Jul 11, 2025Updated 9 months ago
- PostgreSQL tokenizer extension for full-text search☆42Sep 29, 2025Updated 7 months ago
- A diverse, simple, and secure all-in-one LLMOps platform☆112Sep 21, 2024Updated last year
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆31Mar 28, 2025Updated last year