大语言模型评估平台,支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。
☆87Aug 20, 2025Updated 7 months ago
Alternatives and similar repositories for llm-eval
Users that are interested in llm-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,649Updated this week
- 基于jmeter的性能测试平台☆41Feb 2, 2025Updated last year
- fufan-chat-api的前端项目☆27Nov 1, 2024Updated last year
- 灵芝IAST是一款交互式应用安全评估工具,覆盖了Java WEB相关安全风险的检测,具有近实时检测、准确率高、误报率低、漏洞链路清晰等特点|使用之前请阅读官方文档☆16Jul 18, 2020Updated 5 years ago
- 等保测评文档☆12Dec 18, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A code security platform based on fortify sca windows☆15Mar 6, 2019Updated 7 years ago
- Test Management Platform for Automation☆17Aug 24, 2015Updated 10 years ago
- MyBatis-Pageable是一款自动化分页的插件,基于MyBatis内部的插件Interceptor拦截器编写完成,拦截Executor.query的两个重载方法计算出分页的信息以及根据配置的数据库Dialect自动执行不同的查询语句完成总数量的统计。☆14Mar 18, 2019Updated 7 years ago
- A helm chart for deploying Neoload Web on your Kubernetes cluster☆13Updated this week
- 一个医疗数据匿名化工具网站。☆13Jun 28, 2022Updated 3 years ago
- This project is a deliberately vulnerable environment to learn about LLM-specific risks based on the OWASP Top 10 for LLM Applications.☆52Jan 19, 2026Updated 2 months ago
- 本文提出了一个基于“文心一言”的中国LLMs的安全评估基准,其中包括8种典型的安全场景和6种指令攻击类型。此外,本文还提出了安全评估的框架和过程,利用手动编写和收集开源数据的测试Prompts,以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。☆33Sep 1, 2023Updated 2 years ago
- ☆23Oct 28, 2025Updated 5 months ago
- 个人封装的一些开箱即用的Spring Boot Starter组件,简单且实用,后续会根据需求进行持续扩展!☆16Mar 27, 2026Updated 2 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 基于Jmeter实现的在线压测平台,在原有版本基础上进行一些个性化的功能添加;本系统在zyanycall/stressTestPlatform的开源项目基础上开发;☆15Dec 17, 2021Updated 4 years ago
- 智慧城市,使用three.js、Vue3、vite☆10Feb 4, 2024Updated 2 years ago
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- 🔥Sakura Automation Platform🔥是一站式持续自动化平台,涵盖 APP自动化、WEB自动化、API接口自动化、性能自动化,并且支持分布式测试,全面兼容 Appium、Selenium、Rest Assured、JMeter 等主流开源框架,有效助力…☆27Mar 11, 2025Updated last year
- ☆10May 25, 2015Updated 10 years ago
- Spring整合Elasticsearch5.5.1的TransportClient客户端☆19Sep 8, 2017Updated 8 years ago
- 基于Doc2vec和Word2vec的句子对匹配方法☆23Jun 3, 2017Updated 8 years ago
- 景区综合管理平台 ----echats 和 大屏 的完美结合 ,大屏宽度(百分比)高度(rem)自适应☆11Apr 27, 2018Updated 7 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Codes简单易用的一站式研发管理平台 :免费使用 、本地安装、研发管理、测试管理、数字大屏、CI CD、接口测试、缺陷管理、DevTestOps☆29Jun 19, 2023Updated 2 years ago
- 【压测引擎】一个简单易用的性能测试平台,前后端分离项目;支持JMeter分布式压测,日志,报告等☆23Apr 15, 2025Updated 11 months ago
- Claude Agent SDK UI☆47Jan 13, 2026Updated 3 months ago
- ☆27Apr 14, 2025Updated 11 months ago
- Easy Watermark is a simple and easy-to-use watermarking framework that adds watermarks to different types of files using the same method.☆15Oct 12, 2024Updated last year
- OpenHIS医院系统(信创版)集十大核心模块于一体,涵盖目录管理、基础数据配置、个性化设置、门诊/住院全流程管理、药房药库智能管控、精细化耗材管理、财务核算体系、医保合规对接及多维报表分析等功能模块,共计372项标准化功能。☆18Feb 5, 2026Updated 2 months ago
- experimental H5P content for automated feedback on texts☆17Apr 1, 2026Updated last week
- ☆22Dec 7, 2021Updated 4 years ago
- AI写作小工具方案:让2个智能体合作写出真正可用的图文并茂的帖子(微信公众号,小红书,博客)。1, 写作智能体,2,知识库智能体。☆21Jun 8, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Jul 18, 2024Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Converts Swagger files to contracts for Spring Cloud Contract☆26Jul 31, 2020Updated 5 years ago
- 自定义警告提示框☆11Jun 29, 2016Updated 9 years ago
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated last year
- 对比测试不同大语言模型(LLM)性能的工具平台,支持DeepSeek API、Ollama本地模型和VLLM本地模型。A simple tools to test multi models and display the time cost.☆28May 7, 2025Updated 11 months ago