modelscope / evalscopeLinks
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
☆1,827Updated this week
Alternatives and similar repositories for evalscope
Users that are interested in evalscope are comparing it to the libraries listed below
Sorting:
- Community maintained hardware plugin for vLLM on Ascend☆1,230Updated last week
- ☆1,097Updated last month
- 通义千问VLLM推理部署DEMO☆614Updated last year
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆2,853Updated last week
- Train a 1B LLM with 1T tokens from scratch by personal☆741Updated 6 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,407Updated 7 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,307Updated last week
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆645Updated 8 months ago
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,840Updated last month
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,204Updated this week
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,395Updated this week
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆3,836Updated last week
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆2,083Updated last week
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆1,047Updated 3 months ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆790Updated 10 months ago
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆885Updated last week
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆764Updated 7 months ago
- DeepSeek 系列工作解读、扩展和复现。☆681Updated 6 months ago
- ☆748Updated last month
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆473Updated 5 months ago
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆587Updated 11 months ago
- ☆1,498Updated 3 weeks ago
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆167Updated 7 months ago
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,623Updated last year
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,778Updated 3 months ago
- The Open-Source Data Annotation Platform☆945Updated 8 months ago
- Awesome-RAG: Collect typical RAG papers and systems.☆435Updated 2 months ago
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆601Updated 2 years ago
- 开源SFT数据集整理,随时补充☆547Updated 2 years ago
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆622Updated last year