malody2014 / llm_benchmarkLinks

☆572

Alternatives and similar repositories for llm_benchmark

Users that are interested in llm_benchmark are comparing it to the libraries listed below

Sorting:

KCORES / kcores-llm-arena
LLM Arena by KCORES team
☆946Updated 5 months ago
thunlp / LLMxMapReduce
☆815Updated last month
FunnySaltyFish / Better-Ruozhiba
【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集
☆230Updated 6 months ago
bytedance / SandboxFusion
☆665Updated 3 months ago
LSTM-Kirigaya / openmcp-client
All in one vscode plugin for mcp developer
☆524Updated 2 weeks ago
ScienceOne-AI / DeepSeek-671B-SFT-Guide
An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…
☆765Updated 7 months ago
thu-pacman / chitu
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
☆1,296Updated this week
modelscope / evalscope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
☆1,807Updated this week
Leymore / ruozhiba
☆720Updated 2 years ago
knemik97 / Manifesto-against-the-Plagiarist-Yunhe-Wang
讨贼王云鹤檄文
☆1,094Updated 3 months ago
datawhalechina / unlock-deepseek
DeepSeek 系列工作解读、扩展和复现。
☆680Updated 6 months ago
vllm-project / vllm-ascend
Community maintained hardware plugin for vLLM on Ascend
☆1,198Updated this week
zhanshijinwat / Steel-LLM
Train a 1B LLM with 1T tokens from scratch by personal
☆740Updated 5 months ago
kaixindelele / chinarxiv
将gpt_academic的arxiv论文翻译单独抽取出来，更方便部署和集成arxiv论文翻译
☆94Updated last month
bojone / papers.cool
Cool Papers - Immersive Paper Discovery
☆627Updated last month
allwefantasy / auto-coder
☆1,179Updated 3 months ago
qiufengqijun / mini_qwen
这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。
☆635Updated 7 months ago
Qihoo360 / Light-R1
☆745Updated last month
padeoe / hf-mirror-site
a huggingface mirror site.
☆305Updated last year
OpenLMLab / GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
☆686Updated 9 months ago
inclusionAI / AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
☆2,791Updated this week
open-compass / GAOKAO-Eval
☆109Updated last week
intro-llm / intro-llm.github.io
website
☆448Updated 7 months ago
wxywb / history_rag
☆1,032Updated last year
shareAI-lab / Kode
Like Claude Code, but Koding with DeepSeek V3.1, Kimi2, GLM4.5, Qwen Coder etc.（welcome to use Kode to summit PR)
☆3,099Updated last week
brillm05 / BriLLM0.5
☆292Updated 2 months ago
monster119120 / Industrial_LLM_tutorial
☆138Updated 2 months ago
kaixindelele / 2025-Awesome-AI-Bloggers
全网最全-2025年AI领域最值得关注的两百位博主和一手信息源盘点
☆165Updated 9 months ago
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆256Updated 7 months ago
meituan-longcat / LongCat-Flash-Chat
☆1,148Updated 2 weeks ago