modelscope / evalscopeLinks
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
☆1,398Updated this week
Alternatives and similar repositories for evalscope
Users that are interested in evalscope are comparing it to the libraries listed below
Sorting:
- Community maintained hardware plugin for vLLM on Ascend☆926Updated this week
- ☆999Updated this week
- 通义千问VLLM推理部署DEMO☆592Updated last year
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,258Updated 3 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆703Updated 3 months ago
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆522Updated 5 months ago
- Distributed RL System for LLM Reasoning☆2,090Updated this week
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,813Updated 3 weeks ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,173Updated last week
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆985Updated 3 weeks ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆5,782Updated this week
- CMMLU: Measuring massive multitask language understanding in Chinese☆774Updated 7 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,380Updated 4 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆3,166Updated last week
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆131Updated 4 months ago
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆605Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆458Updated 3 months ago
- ☆734Updated 2 months ago
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆546Updated 8 months ago
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,570Updated last year
- ☆348Updated last year
- 开源SFT数据集整理,随时补充☆529Updated 2 years ago
- Build & Optimize your RAG.☆727Updated 2 months ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆733Updated 4 months ago
- Awesome-RAG: Collect typical RAG papers and systems.☆403Updated 6 months ago
- The Open-Source Data Annotation Platform☆888Updated 5 months ago
- A pre-built agent for TableGPT2.☆600Updated 3 weeks ago
- DeepSeek 系列工作解读、扩展和复现。☆665Updated 4 months ago
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,760Updated this week
- ☆170Updated this week