justplus/llm-eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/justplus/llm-eval)

justplus / llm-eval

大语言模型评估平台，支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。

☆98

Alternatives and similar repositories for llm-eval

Users that are interested in llm-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

modelscope / evalscope
View on GitHub
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
☆3,163Updated this week
ryonWang / creative-ai-front
View on GitHub
创意宝是一款融合了前沿人工智能技术的多功能应用，以创新为核心，为用户打造一个集时尚、智能、创意于一体的数字化平台。通过强大的 AI 技术，实现了 AI 试衣、数字人互动、智能短剧编写等特色功能，同时涵盖了丰富的语音、话术及商品管理等实用工具，旨在满足用户在时尚穿搭、内容创作…
☆22Feb 17, 2025Updated last year
BytePioneer-AI / RAGEval
View on GitHub
灵鉴 RAG评测系统 | RAGEval ✨ 开箱即用的RAG系统自动化评估工具 | One-stop RAG Evaluation Solution
☆104Apr 16, 2026Updated 3 months ago
spitfireuptown / datacopilotx
View on GitHub
智能问数系统
☆30Updated this week
lwdgit / web-clipper
View on GitHub
网页剪报
☆12Jan 30, 2016Updated 10 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
BUAADreamer / Qwen2-VL-History
View on GitHub
Qwen2-VL在文旅领域的LLaMA-Factory微调案例 The case for fine-tuning Qwen2-VL in the field of historical literature and museums
☆15Sep 17, 2024Updated last year
domonic18 / ai-eval-system
View on GitHub
这是一个基于OpenCompass的模型评测系统，该系统提供了前端页面UI以方便用户自助开展评测工作。
☆28Aug 25, 2025Updated 11 months ago
Guiwith / ModelVerse
View on GitHub
ModelVerse是一个功能强大的大语言模型(LLM)一体化推训平台，致力于为AI开发者和研究者提供完整的模型生命周期管理解决方案。从模型管理到推理部署，从训练微调到性能评估，ModelVerse将复杂的AI工作流程简化为直观易用的一体化平台。
☆44Updated this week
FlyAIBox / llm_benchmark
View on GitHub
大模型推理压测
☆50Jul 31, 2025Updated 11 months ago
OpenDCAI / DataFlow-WebUI
View on GitHub
An NL2Pipeline Harness for building AI-ready data workflows.
☆109Jul 23, 2026Updated last week
fufankeji / fufan-chat-web
View on GitHub
fufan-chat-api的前端项目
☆27Nov 1, 2024Updated last year
bigprime0 / bigprime-dgp
View on GitHub
bigprimeDGP是以数据驱动为核心优势的大数据数智平台。借助先进的插件化架构和灵活的配置,能够快速响应各种复杂的数据业务需求,为企业的数据治理之路提供技术支撑。它集成了数据集成,数据仓库,计算平台,任务调度,服务编排,数据治理,工作流引擎等技术,致力于通过高效的数据整…
☆16Updated this week
traceless / awkward-proxy
View on GitHub
在内网中使用代理上互联网
☆10Mar 7, 2023Updated 3 years ago
open-compass / CompassJudger
View on GitHub
The All-in-one Judge Models introduced by Opencompass
☆120Jul 15, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
0FuzzingQ / CodeScanner
View on GitHub
A code security platform based on fortify sca windows
☆15Mar 6, 2019Updated 7 years ago
KafCoppelia / Free_Courses_Notes
View on GitHub
智能计算系统陈云霁、数值计算王兵团笔记
☆11Oct 14, 2022Updated 3 years ago
Potterluo / Dream-Interviewer
View on GitHub
在使用AI工具进行深入分析面试岗位需求与应聘者简历的基础上，依据既定标准向应聘者提出针对性问题。在应聘者完成问题解答后，将对简历内容及面试表现进行全面评估，并提供具体的改进建议，以助其提升职业竞争力。
☆17Mar 19, 2024Updated 2 years ago
hengboy / mybatis-pageable
View on GitHub
MyBatis-Pageable是一款自动化分页的插件，基于MyBatis内部的插件Interceptor拦截器编写完成，拦截Executor.query的两个重载方法计算出分页的信息以及根据配置的数据库Dialect自动执行不同的查询语句完成总数量的统计。
☆14Mar 18, 2019Updated 7 years ago
dingzhaowei / TestMP
View on GitHub
Test Management Platform for Automation
☆16Aug 24, 2015Updated 10 years ago
jsfsds / pubmed_search
View on GitHub
一个基于模型上下文协议/MCP 构建的智能医学文献分析工具。它旨在帮助科研人员、医学从业者和学生快速检索 PubMed 数据库，并利用大型语言模型 (LLM) 的能力对文献摘要进行智能分析和总结
☆10Jun 17, 2026Updated last month
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated last year
Neotys-Labs / helm-neoload-web
View on GitHub
A helm chart for deploying Neoload Web on your Kubernetes cluster
☆13Jul 1, 2026Updated 3 weeks ago
huangzworks / rediscookbook
View on GitHub
☆23Oct 28, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
test1213145 / wechat-export
View on GitHub
获取微信聊天记录数据库密钥，各版本通用。
☆14Aug 25, 2022Updated 3 years ago
GreyHy / CampusBBS-SpringBoot-
View on GitHub
基于Springboot开发，实现智能化的校园交流论坛，功能类似于掘金。
☆13Nov 27, 2024Updated last year
fs714 / iperf-gui
View on GitHub
Web Based Iperf Result Real-time Visualization
☆17Apr 26, 2019Updated 7 years ago
felixhpp / kettle-web
View on GitHub
kettle 的web管理工具
☆17Jan 8, 2019Updated 7 years ago
milkoloa / AIBS
View on GitHub
一款自动化写标书的后端代码，开源免费使用
☆35Jun 11, 2025Updated last year
smooth00 / stressTestSystem
View on GitHub
基于Jmeter实现的在线压测平台，在原有版本基础上进行一些个性化的功能添加；本系统在zyanycall/stressTestPlatform的开源项目基础上开发；
☆15Dec 17, 2021Updated 4 years ago
meibaocai / TestMPlatform
View on GitHub
☆13Feb 15, 2023Updated 3 years ago
CreaLabs / Enhanced-BGE-M3-with-CLP-and-MoE
View on GitHub
This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…
☆11Dec 27, 2024Updated last year
lework / llm-benchmark
View on GitHub
LLM 并发性能测试工具，支持自动化压力测试和性能报告生成。
☆270Dec 10, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
open-compass / opencompass
View on GitHub
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆7,246Updated this week
softctwo / patent-agents
View on GitHub
AI-Powered Patent Writing and Drawing Agents - 支持发明专利、实用新型、外观设计专利撰写及附图绘制，基于Google Gemini 2.0 Flash开发，完全符合专利审查指南要求
☆24Nov 13, 2025Updated 8 months ago
muyinchen / muyinchen.github.io
View on GitHub
☆13Jun 30, 2019Updated 7 years ago
trist725 / mailverifi
View on GitHub
邮箱账密批量验证工具。分析SMTP协议，模拟发送并分析SMTP指令，批量验证已知的邮箱用户名和密码是否匹配可用，可自定义输入输出格式、服务器地址、端口，支持SSL/TLS加密。
☆12Dec 15, 2016Updated 9 years ago
joyheros / realworld
View on GitHub
This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…
☆12Dec 1, 2024Updated last year
modelscope / twinkle
View on GitHub
Twinkle✨: Training workbench to make your model glow.
☆249Updated this week
yeze / kdns
View on GitHub
A high-performance DNS Server based on DPDK
☆23Jun 12, 2020Updated 6 years ago