OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
☆277Oct 11, 2023Updated 2 years ago
Alternatives and similar repositories for modelz-llm
Users that are interested in modelz-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆282Nov 3, 2023Updated 2 years ago
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆899Updated this week
- DataFusion Playground with WASM☆14Apr 8, 2024Updated 2 years ago
- OpenDAL fsspec integration☆35Jan 20, 2026Updated 3 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A bridge between different serde implementations.☆16Sep 8, 2025Updated 7 months ago
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 3 years ago
- Kubectl plugin for crane, including recommendation and cost estimate.☆15Apr 24, 2023Updated 3 years ago
- OSPP 2022 Project: String Adaptive Hash Table for Databend☆19Sep 15, 2022Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Jittor code for APDrawingGAN: Generating Artistic Portrait Drawings from Face Photos with Hierarchical GANs (CVPR 2019 Oral)☆13Apr 13, 2021Updated 5 years ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,303Apr 27, 2026Updated last week
- A CoreDNS plugin to create records for Kubernetes nodes.☆13Apr 4, 2023Updated 3 years ago
- Apache OpenDAL Go Binding Services Releases☆15Sep 11, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, X…☆2,467Sep 26, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- Extensible backend of software mirror☆30Mar 23, 2025Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,463Updated this week
- An awesome & curated list of best LLMOps tools for developers☆5,764Apr 6, 2026Updated 3 weeks ago
- 🏕️ Reproducible development environment for humans and agents☆2,200Updated this week
- Run any Large Language Model behind a unified API☆170Nov 13, 2023Updated 2 years ago
- An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.☆329Jul 16, 2024Updated last year
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,172Feb 26, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learn Data Lake From Storage Layer.☆44Aug 4, 2024Updated last year
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- Workflow Defined Engine☆25Nov 4, 2025Updated 6 months ago
- multi-master-paxos with 3 nodes☆14Apr 11, 2022Updated 4 years ago
- A slab allocator with stable references☆15Jan 23, 2023Updated 3 years ago
- ☆12Updated this week
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆26Aug 6, 2020Updated 5 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Quick & Dirty cli to process mysql dumps☆10Sep 30, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- JSONB implement in rust☆86Mar 27, 2026Updated last month
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Apr 23, 2026Updated last week
- Rust bindings for Kubernetes Container Storage Interface generated from Protobuf using Tonic/Prost☆14Aug 4, 2021Updated 4 years ago
- ☆15Jul 18, 2023Updated 2 years ago