modelscope / evalscopeLinks
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
☆2,046Updated last week
Alternatives and similar repositories for evalscope
Users that are interested in evalscope are comparing it to the libraries listed below
Sorting:
- Community maintained hardware plugin for vLLM on Ascend☆1,443Updated this week
- ☆1,185Updated 2 months ago
- 通义千问VLLM推理部署DEMO☆627Updated last year
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,391Updated this week
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,136Updated this week
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,347Updated this week
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,457Updated 3 weeks ago
- Reproduce R1 Zero on Logic Puzzle☆2,416Updated 8 months ago
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,849Updated 3 months ago
- Train a 1B LLM with 1T tokens from scratch by personal☆759Updated 7 months ago
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆1,066Updated 5 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,204Updated this week
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆690Updated 9 months ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆795Updated last year
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆191Updated 8 months ago
- ☆1,616Updated 2 months ago
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆2,448Updated this week
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆901Updated last week
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,789Updated 4 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,357Updated this week
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆604Updated last year
- DeepSeek 系列工作解读、扩展和复现。☆690Updated 8 months ago
- the resources about the application based on LLM with RAG pattern☆1,593Updated last month
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆780Updated 8 months ago
- Awesome-RAG: Collect typical RAG papers and systems.☆444Updated 4 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆4,357Updated last week
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,645Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (…☆11,418Updated last week
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆480Updated 7 months ago
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆854Updated 2 months ago