llm2014 / llm_benchmarkLinks
☆741Updated last week
Alternatives and similar repositories for llm_benchmark
Users that are interested in llm_benchmark are comparing it to the libraries listed below
Sorting:
- LLM Arena by KCORES team☆962Updated 8 months ago
- ☆847Updated 2 months ago
- All in one vscode plugin for mcp developer☆696Updated 2 months ago
- ☆851Updated 3 weeks ago
- ☆741Updated 2 years ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆791Updated 9 months ago
- 【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集☆240Updated 8 months ago
- ☆757Updated 2 weeks ago
- ☆1,204Updated 6 months ago
- A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.☆1,080Updated this week
- Train a 1B LLM with 1T tokens from scratch by personal☆782Updated 8 months ago
- A lightweight multilingual LLM☆1,012Updated 5 months ago
- DeepSeek 系列工作解读、扩展和复现。☆699Updated 9 months ago
- ☆1,260Updated 3 weeks ago
- ☆816Updated 7 months ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆798Updated last year
- a huggingface mirror site.☆322Updated last year
- 全网最全-2025年AI领域最值得关注的两百位博主和一手信息源盘点☆195Updated last year
- 给予Deepseek实现任意模型多模态/联网/强推理功能。☆209Updated 10 months ago
- website☆461Updated 10 months ago
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆716Updated 10 months ago
- 讨贼王云鹤檄文☆1,097Updated 6 months ago
- Cool Papers - Immersive Paper Discovery☆680Updated 4 months ago
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆698Updated last year
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,340Updated this week
- 反重力Agent代理一键脚本,支持WSL、SSH远程☆154Updated last week
- The official repository of the dots.llm1 base and instruct models proposed by rednote-hilab.☆476Updated 4 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,085Updated this week
- TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.☆162Updated last year
- Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sour…☆1,463Updated 10 months ago