helm charts for deploying models with llm-d
☆31Apr 22, 2026Updated last month
Alternatives and similar repositories for llm-d-modelservice
Users that are interested in llm-d-modelservice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simplified model deployment on llm-d☆29Jul 2, 2025Updated 10 months ago
- Hall C++ Analyzer☆10Apr 8, 2026Updated last month
- llm-d benchmark scripts and tooling☆60May 22, 2026Updated last week
- An example of how to use Avalon interrupts on the Cyclone V FPGA☆15May 25, 2014Updated 12 years ago
- llm-d helm charts and deployment examples☆57May 1, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- FreeRTOS with LwIP integration in the Nios II EDS☆19Jan 30, 2016Updated 10 years ago
- Definition, proposals, and conformance tests for AI Conformance☆45Updated this week
- A Golang library for analyzing k8s connectivity-configuration resources (a.k.a. network policies)☆19Feb 1, 2026Updated 3 months ago
- ☆18May 6, 2026Updated 3 weeks ago
- Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)☆52Mar 17, 2026Updated 2 months ago
- A stateful serverless demo app running on AWS Lambda, using Apache Flink Stateful Functions☆15Oct 13, 2020Updated 5 years ago
- A Go gRPC client library for Vald☆13Apr 15, 2026Updated last month
- A shell script for creating a new emqx node for an existing one☆12Sep 14, 2022Updated 3 years ago
- ☆12Oct 1, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PODIO☆34May 20, 2026Updated last week
- Notebooks for Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations☆15Oct 3, 2019Updated 6 years ago
- 低端存储知识☆13Mar 8, 2019Updated 7 years ago
- Japanese synonym library☆11Apr 18, 2022Updated 4 years ago
- Generate boilerplates for layered architecture by your templates.☆13Dec 27, 2019Updated 6 years ago
- llb2dot package lets you to convert BuildKit LLB to dot language to analize. You can also directly load Dockerfile☆10Oct 2, 2019Updated 6 years ago
- CUPTI based GPU profiling library exposing usdt hooks☆31May 20, 2026Updated last week
- Community maintained hardware plugin for vLLM on Spyre☆52May 21, 2026Updated last week
- DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.☆18Nov 4, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Docker packaging for Apache Flink Stateful Functions☆18May 15, 2026Updated 2 weeks ago
- compare WebAssembly build size depends on imported package.☆12Dec 11, 2018Updated 7 years ago
- AllenNLP integration for Shiba: Japanese CANINE model☆12Jun 26, 2021Updated 4 years ago
- generates sakatsu badge from SAUNA-IKITAI.☆11Feb 21, 2021Updated 5 years ago
- A benchmarking tool to evaluate Knative performance☆39Sep 15, 2023Updated 2 years ago
- Alibaba Cloud's high-performance KVCache system for LLM inference, with components for global cache management, inference simulation(HiSi…☆173Updated this week
- ☆15Apr 14, 2023Updated 3 years ago
- Build URL of GCP Cloud Logging Logs Explorer☆16Jul 4, 2024Updated last year
- Awesome List of Sources of Japanese Censored Words☆19Sep 11, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Aug 12, 2020Updated 5 years ago
- vLLM Router☆55Mar 11, 2024Updated 2 years ago
- Tensorflow 2.0 implementation of STAR RNN☆10Jun 7, 2020Updated 5 years ago
- Lightweight threads for Java, with message passing, nio, http and scheduling support.☆17Oct 10, 2014Updated 11 years ago
- 🎲 A Kotlin DSL for probabilistic programming.☆12Apr 8, 2022Updated 4 years ago
- ☆24Oct 9, 2025Updated 7 months ago
- bqiam is an admin tool for managing BigQuery permissions☆12Apr 24, 2026Updated last month