OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
☆276Oct 11, 2023Updated 2 years ago
Alternatives and similar repositories for modelz-llm
Users that are interested in modelz-llm are comparing it to the libraries listed below
Sorting:
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆281Nov 3, 2023Updated 2 years ago
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆892Updated this week
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- OpenDAL fsspec integration☆34Jan 20, 2026Updated last month
- DataFusion Playground with WASM☆14Apr 8, 2024Updated last year
- ☆15Jan 25, 2024Updated 2 years ago
- The Databend plugin for dbt (data build tool)☆12Mar 17, 2023Updated 2 years ago
- A bridge between different serde implementations.☆16Sep 8, 2025Updated 5 months ago
- OSPP 2022 Project: String Adaptive Hash Table for Databend☆19Sep 15, 2022Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- This repository `II-Commons` contains tools for managing text and image datasets, including loading, fetching, and embedding large datase…☆33Jul 22, 2025Updated 7 months ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,140Feb 23, 2026Updated last week
- QDrant docker-compose deployment with basic auth/nginx proxy☆23Apr 12, 2023Updated 2 years ago
- A collection of actions for working with ROS data☆14Jun 11, 2025Updated 8 months ago
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆25Aug 6, 2020Updated 5 years ago
- Rust based high-performance Apache Uniffle shuffle-server☆62Updated this week
- Learn Data Lake From Storage Layer.☆44Aug 4, 2024Updated last year
- 🦙🦙.🦀☆28Sep 24, 2023Updated 2 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- POM: Occupancy map estimation for people detection☆10Aug 5, 2014Updated 11 years ago
- Operating System☆10Jun 14, 2025Updated 8 months ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- Explore the possibilities of the Yazi plugin system, and provide some experimental feature enhancements.☆12Feb 1, 2024Updated 2 years ago
- An example starter repo using NextJS + AWS Lambda/APG to build a web app with theOpenAI APU☆13Sep 5, 2023Updated 2 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- ☆12Aug 14, 2025Updated 6 months ago
- Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.☆2,158Feb 26, 2025Updated last year
- An awesome & curated list of best LLMOps tools for developers☆5,645Feb 3, 2026Updated last month
- JSONB implement in rust☆86Jan 27, 2026Updated last month
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆29Oct 31, 2019Updated 6 years ago
- Workflow Defined Engine☆25Nov 4, 2025Updated 4 months ago
- 🏕️ Reproducible development environment for humans and agents☆2,184Updated this week
- Experimental repository for GSoC 2024.☆15Aug 29, 2024Updated last year
- A fast & easy way to train ML models in your cloud, directly from your laptop.☆14Mar 28, 2022Updated 3 years ago
- Evaluation code of ASE24 accepted paper "On the Evaluation of LLM in Unit Test Generation"☆13Dec 9, 2024Updated last year
- OpenAI compatible API for open source LLMs☆16Oct 30, 2023Updated 2 years ago
- multi-master-paxos with 3 nodes☆13Apr 11, 2022Updated 3 years ago
- Studying GPU Multi-tenancy☆11Jan 11, 2019Updated 7 years ago