modelscope / evalscopeLinks
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
☆1,038Updated this week
Alternatives and similar repositories for evalscope
Users that are interested in evalscope are comparing it to the libraries listed below
Sorting:
- Community maintained hardware plugin for vLLM on Ascend☆703Updated this week
- ☆931Updated 3 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆663Updated last month
- Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.☆900Updated this week
- 通义千问VLLM推理部署DEMO☆580Updated last year
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆408Updated 3 months ago
- Distributed RL System for LLM Reasoning☆1,284Updated this week
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,760Updated 3 months ago
- The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.☆1,106Updated this week
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆2,515Updated this week
- 开源SFT数据集整理,随时补充☆516Updated 2 years ago
- Reproduce R1 Zero on Logic Puzzle☆2,347Updated 2 months ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆763Updated 5 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,129Updated this week
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆435Updated last month
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆482Updated 6 months ago
- ☆328Updated 11 months ago
- ☆706Updated this week
- Awesome-RAG: Collect typical RAG papers and systems.☆381Updated 4 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆327Updated last month
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆756Updated 2 weeks ago
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆534Updated 7 months ago
- Build & Optimize your RAG.☆671Updated 2 weeks ago
- LongBench v2 and LongBench (ACL 2024)☆883Updated 4 months ago
- ☆939Updated 3 months ago
- DeepSeek 系列工作解读、扩展和复现。☆652Updated 2 months ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆389Updated 9 months ago
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆585Updated last year
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,539Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆325Updated 10 months ago