A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.
☆31Mar 28, 2025Updated last year
Alternatives and similar repositories for deepseek-api-arena
Users that are interested in deepseek-api-arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Go Client for jAccount☆12Jul 18, 2025Updated 10 months ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆53May 17, 2026Updated last week
- DEPRECATED, please use upstream at @sjtug☆13Dec 26, 2017Updated 8 years ago
- Repository for batch predict☆17Dec 1, 2021Updated 4 years ago
- ☆20Mar 11, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于 Touying 的上海交通大学 Typst 幻灯片模板 (Typst Slide Theme for SJTU Based on Touying)☆24Jan 27, 2026Updated 3 months ago
- An experimental tool to modify YAMLs without losing (most of) comment lines.☆16Sep 25, 2022Updated 3 years ago
- Asynchronous pipeline parallel optimization☆21Feb 2, 2026Updated 3 months ago
- An experimental implementation of 'try' operator for Go☆13Jun 13, 2019Updated 6 years ago
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆298May 14, 2026Updated last week
- ☆19May 31, 2018Updated 7 years ago
- Visualize your GitHub relationship using GitHub API v3 in JavaScript, Mathematica, Python or Scala.☆11Jul 15, 2015Updated 10 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- engula-operator creates/configures/manages engula clusters atop Kubernetes☆12Jan 5, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- my bachelor's thesis in SJTU about https://github.com/caicloud/cyclone☆12Jan 4, 2018Updated 8 years ago
- ☆110May 10, 2026Updated 2 weeks ago
- Writing a CUDA software ray tracing renderer with Analysis-Driven Optimization from scratch: a python-importable, distributed parallel re…☆37Apr 12, 2026Updated last month
- 滴滴云推理服务的 HTTP 客户端示例代码☆21Nov 21, 2022Updated 3 years ago
- Persistent Kernel + JIT-Injected Operators (CUDA)☆47Jan 27, 2026Updated 3 months ago
- ☆36Apr 30, 2026Updated 3 weeks ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆148Dec 4, 2024Updated last year
- ☆105May 31, 2025Updated 11 months ago
- flex-block-attn: an efficient block sparse attention computation library☆131Dec 26, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- 高性能计算实验室文档模板☆14Aug 11, 2017Updated 8 years ago
- 🌈 Custom color scheme for github contribution bar. Created by @gigaflw☆34Nov 14, 2018Updated 7 years ago
- Unofficial GitLab Android client. Support self hosted GitLab and Push notifications☆10May 18, 2016Updated 10 years ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆18May 11, 2026Updated 2 weeks ago
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- Website for CSE 234, Winter 2025☆15Mar 24, 2025Updated last year
- ☆10Nov 18, 2024Updated last year
- PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]☆52Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Personal Blog in github.io☆10Feb 25, 2026Updated 3 months ago
- triton for dsa☆63May 12, 2026Updated last week
- 在 Telegram 上快速模仿迟先生卖弱。☆16May 11, 2026Updated last week
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆30May 18, 2026Updated last week
- Dongyue Web Studio course and lecture☆12Apr 25, 2018Updated 8 years ago
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12May 18, 2026Updated last week
- rpc_learn Spring + Netty + Protostuff + ZooKeeper 实现了一个轻量级 RPC 框架,使用 Spring 提供依赖注入与参数配置,使用 Netty 实现 NIO 方式的数据传输,使用 Protostuff 实现对象序列化,使用 …☆19May 26, 2015Updated 10 years ago