OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
☆277Oct 11, 2023Updated 2 years ago
Alternatives and similar repositories for modelz-llm
Users that are interested in modelz-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆281Nov 3, 2023Updated 2 years ago
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆893Mar 1, 2026Updated 3 weeks ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Jun 28, 2023Updated 2 years ago
- DataFusion Playground with WASM☆14Apr 8, 2024Updated last year
- OpenDAL fsspec integration☆34Jan 20, 2026Updated 2 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- A bridge between different serde implementations.☆16Sep 8, 2025Updated 6 months ago
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 3 years ago
- Kubectl plugin for crane, including recommendation and cost estimate.☆15Apr 24, 2023Updated 2 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Jittor code for APDrawingGAN: Generating Artistic Portrait Drawings from Face Photos with Hierarchical GANs (CVPR 2019 Oral)☆13Apr 13, 2021Updated 4 years ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,174Mar 16, 2026Updated last week
- A CoreDNS plugin to create records for Kubernetes nodes.☆13Apr 4, 2023Updated 2 years ago
- Apache OpenDAL Go Binding Services Releases☆15Sep 11, 2025Updated 6 months ago
- Document Q&A on Wikipedia articles using LLMs☆81Sep 15, 2023Updated 2 years ago
- Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, X…☆2,467Sep 26, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- Extensible backend of software mirror☆29Mar 23, 2025Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,428Jun 2, 2025Updated 9 months ago
- An awesome & curated list of best LLMOps tools for developers☆5,668Feb 3, 2026Updated last month
- 🏕️ Reproducible development environment for humans and agents☆2,187Mar 5, 2026Updated 2 weeks ago
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,162Feb 26, 2025Updated last year
- API definitions for the crane project☆19Jul 23, 2025Updated 8 months ago
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- Workflow Defined Engine☆25Nov 4, 2025Updated 4 months ago
- multi-master-paxos with 3 nodes☆14Apr 11, 2022Updated 3 years ago
- A slab allocator with stable references☆15Jan 23, 2023Updated 3 years ago
- ☆12Aug 14, 2025Updated 7 months ago
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆25Aug 6, 2020Updated 5 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Quick & Dirty cli to process mysql dumps☆10Sep 30, 2022Updated 3 years ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- JSONB implement in rust☆86Jan 27, 2026Updated last month
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Mar 11, 2026Updated last week
- Rust bindings for Kubernetes Container Storage Interface generated from Protobuf using Tonic/Prost☆14Aug 4, 2021Updated 4 years ago
- An awesome & curated list of best LLMOps tools for developers☆24Jun 21, 2023Updated 2 years ago
- ☆15Jul 18, 2023Updated 2 years ago
- Bio☆13Updated this week