modelscope / evalscopeLinks
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
☆1,173Updated this week
Alternatives and similar repositories for evalscope
Users that are interested in evalscope are comparing it to the libraries listed below
Sorting:
- Community maintained hardware plugin for vLLM on Ascend☆773Updated this week
- Train a 1B LLM with 1T tokens from scratch by personal☆679Updated last month
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,151Updated last week
- 通义千问VLLM推理部署DEMO☆581Updated last year
- Distributed RL System for LLM Reasoning☆1,774Updated this week
- ☆954Updated last month
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆428Updated 4 months ago
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆937Updated 2 weeks ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,144Updated this week
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,776Updated 4 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆2,691Updated this week
- 欢 迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆778Updated 2 weeks ago
- ☆714Updated 3 weeks ago
- Reproduce R1 Zero on Logic Puzzle☆2,355Updated 3 months ago
- Build & Optimize your RAG.☆698Updated last month
- CMMLU: Measuring massive multitask language understanding in Chinese☆765Updated 6 months ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆441Updated last month
- 大模型多维度中文对齐评测基准 (ACL 2024)☆392Updated 10 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆801Updated 2 weeks ago
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆517Updated 7 months ago
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆541Updated 7 months ago
- ☆784Updated last week
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆3,436Updated this week
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆651Updated last year
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆595Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆328Updated 11 months ago
- TinyRAG☆305Updated this week
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆562Updated last year
- minimal-cost for training 0.5B R1-Zero☆742Updated last month
- DeepSeek 系列工作解读、扩展和复现。☆657Updated 2 months ago