modelscope / evalscope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
☆792Updated this week
Alternatives and similar repositories for evalscope:
Users that are interested in evalscope are comparing it to the libraries listed below
- Community maintained hardware plugin for vLLM on Ascend☆459Updated this week
- 通义千问VLLM推理部署DEMO☆562Updated last year
- ☆843Updated 2 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆604Updated last month
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆984Updated this week
- Distributed RL System for LLM Reasoning☆1,079Updated last week
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆350Updated last month
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆419Updated 2 weeks ago
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆788Updated this week
- 开源SFT数据集整理,随时补充☆506Updated last year
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆1,968Updated this week
- ☆643Updated this week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆312Updated 8 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆375Updated 8 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆548Updated 9 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆689Updated 2 months ago
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆514Updated last year
- ☆319Updated 10 months ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆751Updated 4 months ago
- LongBench v2 and LongBench (ACL 2024)☆836Updated 3 months ago
- CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models☆295Updated 5 months ago
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,704Updated 2 months ago
- 从零实现一个小参数量中文大语言模型。☆593Updated 7 months ago
- Awesome-RAG: Collect typical RAG papers and systems.☆352Updated 2 months ago
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆706Updated this week
- DeepSeek 系列工作解读、扩展和复现。☆622Updated 2 weeks ago
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆562Updated 11 months ago
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆513Updated 5 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,086Updated this week
- LLM Inference benchmark☆406Updated 8 months ago