tensorchord/deepseek-api-arena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tensorchord/deepseek-api-arena)

tensorchord / deepseek-api-arena

A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.

☆31

Alternatives and similar repositories for deepseek-api-arena

Users that are interested in deepseek-api-arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dyweb / sundial
View on GitHub
[WIP] Open Source WakaTime Server
☆14Feb 4, 2019Updated 7 years ago
dyweb / go-jaccount
View on GitHub
Go Client for jAccount
☆12Jul 18, 2025Updated last year
dyweb / SJTUThesis
View on GitHub
DEPRECATED, please use upstream at @sjtug
☆12Dec 26, 2017Updated 8 years ago
bytedance / InfiniStore
View on GitHub
KV cache store for distributed LLM inference
☆425Nov 13, 2025Updated 8 months ago
AkihiroSuda / yamlctl
View on GitHub
An experimental tool to modify YAMLs without losing (most of) comment lines.
☆16Sep 25, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
rhysd / trygo
View on GitHub
An experimental implementation of 'try' operator for Go
☆13Jun 13, 2019Updated 7 years ago
mpvl / errdare
View on GitHub
☆19May 31, 2018Updated 8 years ago
NEOS-AI / Neosearch
View on GitHub
AI-based search done right
☆20Dec 25, 2025Updated 6 months ago
Infrasys-AI / aiinfra-docs
View on GitHub
☆21Nov 6, 2025Updated 8 months ago
LMCache / LMBenchmark
View on GitHub
Systematic and comprehensive benchmarks for LLM systems.
☆62Jan 28, 2026Updated 5 months ago
cow-on-board / engula-operator
View on GitHub
engula-operator creates/configures/manages engula clusters atop Kubernetes
☆12Jan 5, 2022Updated 4 years ago
gpucloud / k8s-device-plugin
View on GitHub
NVIDIA device plugin for Kubernetes
☆15Sep 9, 2019Updated 6 years ago
gaocegege / bachelor-paper-2016
View on GitHub
my bachelor's thesis in SJTU about https://github.com/caicloud/cyclone
☆12Jan 4, 2018Updated 8 years ago
didiyun / inference-client
View on GitHub
滴滴云推理服务的 HTTP 客户端示例代码
☆21Nov 21, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
xhHuang94 / hpcl_document_latex_template
View on GitHub
高性能计算实验室文档模板
☆14Aug 11, 2017Updated 8 years ago
sgl-project / DeepGEMM
View on GitHub
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
☆32Updated this week
knoway-dev / knoway
View on GitHub
An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises
☆27Apr 24, 2025Updated last year
Infini-AI-Lab / MagicDec
View on GitHub
[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding
☆155Dec 4, 2024Updated last year
gaocegege / Blog
View on GitHub
Personal Blog in github.io
☆11Feb 25, 2026Updated 5 months ago
openshift-psap / topsail
View on GitHub
Test Orchestrator for Performance and Scalability of AI pLatforms
☆18Jun 23, 2026Updated last month
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
dyweb / course
View on GitHub
Dongyue Web Studio course and lecture
☆12Apr 25, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhuzilin / flash-attention-with-sink
View on GitHub
☆37Aug 7, 2025Updated 11 months ago
tomtom-international / openlr-python
View on GitHub
OpenLR library for Python
☆15Jul 14, 2025Updated last year
NVIDIA / k8s-operator-libs
View on GitHub
A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.
☆30Updated this week
YibooZhao / cogvideox_vis_attention
View on GitHub
☆10Nov 18, 2024Updated last year
tensorchord / qtext
View on GitHub
☆19Apr 11, 2024Updated 2 years ago
arrowrowe / tam
View on GitHub
Tam is the Assets Manager for you.
☆18Sep 24, 2016Updated 9 years ago
network-automation / httpapi
View on GitHub
example using the httpapi connection plugin
☆11Sep 19, 2018Updated 7 years ago
dyweb / blog
View on GitHub
Dongyue Tech Blog
☆14Jan 5, 2026Updated 6 months ago
feifeibear / ChituAttention
View on GitHub
Quantized Attention on GPU
☆45Nov 22, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dyweb / mos
View on GitHub
MOOC & Open Source Group
☆19Apr 1, 2018Updated 8 years ago
thib-s / flash-newton-schulz
View on GitHub
My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.
☆38Apr 30, 2026Updated 2 months ago
caicloud / golang-template-project
View on GitHub
A template for starting new golang projects at Caicloud
☆55Nov 28, 2022Updated 3 years ago
AI-Hypercomputer / inference-benchmark
View on GitHub
☆22Mar 11, 2026Updated 4 months ago
Toseic / LLM-inference-arxiv-daily
View on GitHub
🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)
☆12Updated this week
sail-sg / tty-use
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
ruvnet / ruv-engineer
View on GitHub
rUv-Engineer - let's you describe UI using your imagination, then see it rendered live.
☆13Sep 28, 2024Updated last year