OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
☆277Oct 11, 2023Updated 2 years ago
Alternatives and similar repositories for modelz-llm
Users that are interested in modelz-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆283Nov 3, 2023Updated 2 years ago
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆902Jun 25, 2026Updated last week
- DataFusion Playground with WASM☆14Apr 8, 2024Updated 2 years ago
- OpenDAL fsspec integration☆37Jan 20, 2026Updated 5 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 3 years ago
- OSPP 2022 Project: String Adaptive Hash Table for Databend☆19Sep 15, 2022Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Jittor code for APDrawingGAN: Generating Artistic Portrait Drawings from Face Photos with Hierarchical GANs (CVPR 2019 Oral)☆13Apr 13, 2021Updated 5 years ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,381Updated this week
- A CoreDNS plugin to create records for Kubernetes nodes.☆13Apr 4, 2023Updated 3 years ago
- Apache OpenDAL Go Binding Services Releases☆16Jun 1, 2026Updated last month
- Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, X…☆2,457Sep 26, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Extensible backend of software mirror☆30Mar 23, 2025Updated last year
- An openAI CLI built in rust☆10Dec 28, 2022Updated 3 years ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,492May 1, 2026Updated 2 months ago
- 🦙🦙.🦀☆28Sep 24, 2023Updated 2 years ago
- An awesome & curated list of best LLMOps tools for developers☆5,866May 21, 2026Updated last month
- 🏕️ Reproducible development environment for humans and agents☆2,211May 21, 2026Updated last month
- Run any Large Language Model behind a unified API☆169Nov 13, 2023Updated 2 years ago
- An easy way to host your own AI API and expose alternative models, while being compatible with "open" AI clients.☆328Jul 16, 2024Updated last year
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,175Feb 26, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Learn Data Lake From Storage Layer.☆44Aug 4, 2024Updated last year
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆27Dec 6, 2024Updated last year
- Workflow Defined Engine☆25Nov 4, 2025Updated 8 months ago
- multi-master-paxos with 3 nodes☆14Apr 11, 2022Updated 4 years ago
- A slab allocator with stable references☆15Jan 23, 2023Updated 3 years ago
- ☆12Jun 8, 2026Updated 3 weeks ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Quick & Dirty cli to process mysql dumps☆10Sep 30, 2022Updated 3 years ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- JSONB implement in rust☆87Jun 15, 2026Updated 2 weeks ago
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Apr 23, 2026Updated 2 months ago
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- Rust bindings for Kubernetes Container Storage Interface generated from Protobuf using Tonic/Prost☆15Aug 4, 2021Updated 4 years ago
- ☆15Jul 18, 2023Updated 2 years ago
- Rust based high-performance Apache Uniffle shuffle-server☆68Jun 26, 2026Updated last week