llm2014/llm_benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/llm2014/llm_benchmark)

llm2014 / llm_benchmark

☆1,455

Alternatives and similar repositories for llm_benchmark

Users that are interested in llm_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KCORES / kcores-llm-arena
View on GitHub
LLM Arena by KCORES team
☆953Apr 29, 2025Updated last year
victorchen96 / deepseek_v4_rolepaly_instruct
View on GitHub
对于DeepSeek-V4角色扮演的特殊控制指令的说明
☆2,206May 28, 2026Updated last month
CherryHQ / cherry-studio
View on GitHub
AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs
☆48,843Updated this week
rikkahub / rikkahub
View on GitHub
RikkaHub is an Android APP that supports for multiple LLM providers.
☆6,215Updated this week
CherryHQ / cherry-studio-app
View on GitHub
🍒 This is the mobile version of Cherry Studio.
☆3,594Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
router-for-me / CLIProxyAPI
View on GitHub
Wrap Antigravity, ChatGPT Codex, Claude Code, Grok Build as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy t…
☆44,056Updated this week
jeinlee1991 / chinese-llm-benchmark
View on GitHub
非线智能 NoneLinear - ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括374个大模型，覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、…
☆6,302Updated this week
QuantumNous / new-api
View on GitHub
A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatibl…
☆42,955Updated this week
looplj / axonhub
View on GitHub
⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing.
☆4,752Updated this week
ding113 / claude-code-hub
View on GitHub
一个现代化的 Claude Code & Codex API 代理服务，提供智能负载均衡、用户管理和使用统计功能。
☆3,265Updated this week
farion1231 / cc-switch
View on GitHub
A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official websit…
☆119,703Updated this week
mengxi-ream / read-frog
View on GitHub
🐸 Read Frog - Open Source Immersive Translate | 🐸 陪读蛙 - 开源沉浸式翻译
☆8,619Updated this week
musistudio / claude-code-router
View on GitHub
One local control plane for every AI agent: route across models, fuse new capabilities, orchestrate tools, and stay fully in control.
☆36,042Updated this week
clash-verge-rev / clash-verge-rev
View on GitHub
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
☆132,904Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
opendatalab / MinerU
View on GitHub
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
☆75,311Updated this week
bestruirui / octopus
View on GitHub
One Hub All LLMs For You | 为个人打造的 LLM API 聚合网关
☆2,296May 28, 2026Updated last month
Chevey339 / kelivo
View on GitHub
A Flutter LLM Chat Client. Support Mobile & Desktop.
☆3,322Updated this week
CLUEbenchmark / SuperCLUE
View on GitHub
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
☆3,297Feb 6, 2026Updated 5 months ago
GuDaStudio / GrokSearch
View on GitHub
Integrate Grok's powerful real-time search capabilities into Claude via the MCP protocol!
☆1,825Mar 9, 2026Updated 4 months ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,831Jul 14, 2026Updated last week
AstrBotDevs / AstrBot
View on GitHub
AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw a…
☆37,453Updated this week
stepfun-ai / SteptronOss
View on GitHub
A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…
☆576May 18, 2026Updated 2 months ago
SillyTavern / SillyTavern
View on GitHub
LLM Frontend for Power Users.
☆30,971Jul 11, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
mindfold-ai / Trellis
View on GitHub
The best agent harness.
☆12,957Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,569Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,587Updated this week
modelscope / ms-swift
View on GitHub
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…
☆14,887Updated this week
tbphp / gpt-load
View on GitHub
Multi-channel AI proxy with intelligent key rotation. 智能密钥轮询的多渠道 AI 代理。
☆6,254Updated this week
anomalyco / opencode
View on GitHub
The open source coding agent.
☆188,231Updated this week
fishjar / kiss-translator
View on GitHub
A simple, open source bilingual translation extension & Greasemonkey script (一个简约、开源的双语对照翻译扩展 & 油猴脚本)
☆11,423Updated this week
QwenLM / Qwen3
View on GitHub
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆27,408Jan 9, 2026Updated 6 months ago
wanxiaoT / rikkahubx
View on GitHub
[这不是受到RikkaHub支持的版本！]RikkaHubX是RikkaHub的独立分支，在RikkaHub的基础上添加了部分修改功能，不会提交PR给原项目
☆20Dec 26, 2025Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yeahhe365 / Dual-AI-Chat
View on GitHub
一个先进的聊天应用，演示了一种独特的对话范式：用户的查询首先由两个不同的人工智能角色进行辩论和提炼，然后才提供最终的综合答案。该项目利用 Google Gemini API 驱动一个逻辑型 AI (Cognito) 和一个怀疑型 AI (Muse)，它们协作生成更健壮、准确…
☆350Updated this week
code-yeongyu / oh-my-openagent
View on GitHub
omo/lazycodex: The coding agent for tokenmaxxers;the one and only agent harness for complex codebases. For your Codex, for your OpenCode
☆66,329Updated this week
Wei-Shaw / sub2api
View on GitHub
Sub2API 一站式开源中转服务，让 Claude、Openai 、Gemini、Grok订阅统一接入，支持拼车共享，更高效分摊成本，原生工具无缝使用。
☆33,331Updated this week
ztx888 / HaloWebUI
View on GitHub
基于官方OpenWebUI，汉化界面提高中文使用体验，增加了模型计费和用量统计
☆1,165Jun 4, 2026Updated last month
UfoMiao / zcf
View on GitHub
Zero-Config Code Flow for Claude code & Codex
☆6,077Updated this week
Lingyan000 / fluxdo
View on GitHub
一个 Linux.do 第三方客户端
☆2,036Updated this week
vectara / hallucination-leaderboard
View on GitHub
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
☆3,288May 11, 2026Updated 2 months ago