llm2014 / llm_benchmarkLinks
☆764Updated last week
Alternatives and similar repositories for llm_benchmark
Users that are interested in llm_benchmark are comparing it to the libraries listed below
Sorting:
- LLM Arena by KCORES team☆961Updated 9 months ago
- ☆857Updated 3 months ago
- All in one vscode plugin for mcp developer☆719Updated 3 weeks ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆794Updated 10 months ago
- ☆919Updated last month
- ☆745Updated 2 years ago
- 【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集☆243Updated 9 months ago
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆703Updated last year
- A lightweight multilingual LLM☆1,012Updated 5 months ago
- OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards☆355Updated this week
- a huggingface mirror site.☆326Updated last year
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,360Updated last week
- ☆761Updated last month
- 大模型优化的拼音输入法☆465Updated this week
- ☆1,212Updated 7 months ago
- 将知乎专栏文章转换为 Markdown 文件保存到本地☆554Updated 10 months ago
- Train a 1B LLM with 1T tokens from scratch by personal☆787Updated 9 months ago
- 将gpt_academic的arxiv论文翻译单独抽取出来,并集成BabelOCR,支持本地PDF翻译,翻译成功率提高到95%+☆140Updated 2 months ago
- DeepSeek 系列工作解读、扩展和复现。☆699Updated 10 months ago
- Kode CLI — Design for post-human workflows. One unit agent for every human & computer task.☆4,271Updated 2 weeks ago
- 全网最全-2025年AI领域最值得关注的两百位博主和一手信息源盘点☆205Updated last year
- 讨贼王云鹤檄文☆1,102Updated 6 months ago
- Cool Papers - Immersive Paper Discovery☆701Updated 5 months ago
- The official repository of the dots.llm1 base and instruct models proposed by rednote-hilab.☆487Updated 5 months ago
- ☆1,300Updated this week
- CMMLU: Measuring massive multitask language understanding in Chinese☆801Updated last year
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker☆2,217Updated 2 weeks ago
- TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.☆163Updated last year
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,807Updated 6 months ago
- LLM101n: Let's build a Storyteller 中文版☆137Updated last year