☆18Mar 11, 2026Updated last week
Alternatives and similar repositories for inference-benchmark
Users that are interested in inference-benchmark are comparing it to the libraries listed below
Sorting:
- ☆33Feb 4, 2026Updated last month
- Gateway API Inference Extension☆609Updated this week
- Package chanstream implements an API compatible with and similiar to the TCP connection (and net.Conn as well) API, on top of Go channels…☆14Sep 2, 2020Updated 5 years ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 8 months ago
- ☆13Oct 27, 2023Updated 2 years ago
- Do Framework Definition☆17Sep 13, 2024Updated last year
- A starting point for creating service brokers implementing the Open Service Broker API☆30Aug 11, 2017Updated 8 years ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆25Apr 24, 2025Updated 10 months ago
- Inference scheduler for llm-d☆140Mar 12, 2026Updated last week
- ☆18Feb 17, 2020Updated 6 years ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Updated this week
- Compatibility layer to run Cloud Foundry applications on OpenShift☆11Sep 2, 2016Updated 9 years ago
- An earning call robot built with LLM☆10Aug 4, 2023Updated 2 years ago
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆29Mar 6, 2026Updated 2 weeks ago
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆15Jul 17, 2025Updated 8 months ago
- ☆24Jan 27, 2026Updated last month
- TPU inference for vLLM, with unified JAX and PyTorch support.☆262Updated this week
- ☆24Jun 24, 2025Updated 8 months ago
- [Deprecated] Vulnerability scanner for containers and images☆13Oct 26, 2015Updated 10 years ago
- Machine Learning from Human Preferences☆30Feb 13, 2026Updated last month
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- ☆20Nov 26, 2024Updated last year
- Synchronizes OpenShift BuildConfig objects as Jenkins jobs and synchronizes job status into OpenShift Build objects☆17Jul 10, 2025Updated 8 months ago
- Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.☆19Jun 21, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆34Jul 11, 2025Updated 8 months ago
- ☆32Nov 4, 2024Updated last year
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆26Feb 18, 2025Updated last year
- ☆17Apr 16, 2021Updated 4 years ago
- ☆10Mar 3, 2026Updated 2 weeks ago
- WG Serving☆34Mar 5, 2026Updated 2 weeks ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆30Mar 28, 2025Updated 11 months ago
- RDMA CNI plugin for containerized workloads☆60Mar 10, 2026Updated last week
- Fleetboard establishes an independent and unified parallel network, facilitating cross-cluster service discovery even in cases of IP over…☆31May 13, 2025Updated 10 months ago
- Explore graphs in a visual way☆17Mar 8, 2026Updated last week
- WordPress plugin that provides a Gutenberg block for embedding the SemiAnalysis die yield calculator in posts and pages☆19Oct 10, 2025Updated 5 months ago
- ☆37May 15, 2025Updated 10 months ago
- Models for data stocks and training dataset sizes☆18Jul 10, 2024Updated last year
- acmeair-netflixoss-dockerlocal☆33Aug 6, 2014Updated 11 years ago