☆20Mar 11, 2026Updated 2 months ago
Alternatives and similar repositories for inference-benchmark
Users that are interested in inference-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Feb 18, 2025Updated last year
- ☆36Updated this week
- Gateway API Inference Extension☆679May 28, 2026Updated last week
- Package chanstream implements an API compatible with and similiar to the TCP connection (and net.Conn as well) API, on top of Go channels…☆14Sep 2, 2020Updated 5 years ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Oct 27, 2023Updated 2 years ago
- Do Framework Definition☆17Sep 13, 2024Updated last year
- A starting point for creating service brokers implementing the Open Service Broker API☆30Aug 11, 2017Updated 8 years ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- ☆18Feb 17, 2020Updated 6 years ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆18May 26, 2026Updated last week
- Compatibility layer to run Cloud Foundry applications on OpenShift☆11Sep 2, 2016Updated 9 years ago
- llm-d Router: The intelligent entry point for inference requests☆214Updated this week
- An earning call robot built with LLM☆10Aug 4, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆30May 18, 2026Updated 3 weeks ago
- ☆25Apr 1, 2026Updated 2 months ago
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆14Jul 17, 2025Updated 10 months ago
- ☆25Jun 24, 2025Updated 11 months ago
- [Deprecated] Vulnerability scanner for containers and images☆13Oct 26, 2015Updated 10 years ago
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- ☆24Nov 26, 2024Updated last year
- Synchronizes OpenShift BuildConfig objects as Jenkins jobs and synchronizes job status into OpenShift Build objects☆17Mar 24, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- TPU inference for vLLM, with unified JAX and PyTorch support.☆348Updated this week
- Machine Learning from Human Preferences☆34Mar 23, 2026Updated 2 months ago
- Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.☆20Jun 21, 2024Updated last year
- ☆32Nov 4, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆36Jul 11, 2025Updated 10 months ago
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆32Mar 30, 2026Updated 2 months ago
- ☆17Apr 16, 2021Updated 5 years ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆31Mar 28, 2025Updated last year
- WG Serving☆37Mar 24, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Apr 29, 2026Updated last month
- RDMA CNI plugin for containerized workloads☆60Updated this week
- Fleetboard establishes an independent and unified parallel network, facilitating cross-cluster service discovery even in cases of IP over…☆31May 13, 2025Updated last year
- Explore graphs in a visual way☆17May 19, 2026Updated 2 weeks ago
- WordPress plugin that provides a Gutenberg block for embedding the SemiAnalysis die yield calculator in posts and pages☆21Updated this week
- Models for data stocks and training dataset sizes☆19Jul 10, 2024Updated last year
- acmeair-netflixoss-dockerlocal☆33Aug 6, 2014Updated 11 years ago