大语言模型评估平台,支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。
☆92Aug 20, 2025Updated 9 months ago
Alternatives and similar repositories for llm-eval
Users that are interested in llm-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于jmeter的性能测试平台☆41Feb 2, 2025Updated last year
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,934Jun 12, 2026Updated last week
- fufan-chat-api的前端项目☆27Nov 1, 2024Updated last year
- A performance load tests platform base python3+vue3+locust+grafana,cool and user-friendly(性能测试平台)☆14Apr 22, 2024Updated 2 years ago
- A code security platform based on fortify sca windows☆15Mar 6, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Test Management Platform for Automation☆16Aug 24, 2015Updated 10 years ago
- A practical utility library for LangChain and LangGraph development☆105Mar 4, 2026Updated 3 months ago
- 灵猫智能管理平台是一个在线web测试项目与测试工具管理平台,通过灵猫智能快速敏捷的灵活性,实现项目管理、用例管理、模块管理、UI自动化测试管理、小工具应用等等系统的测试☆11Jun 21, 2021Updated 4 years ago
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆40Sep 12, 2025Updated 9 months ago
- 一个基于 模型上下文协议/MCP 构建的智能医学文献分析工具。它旨在帮助科研人员、医学从业者和学生快速检索 PubMed 数据库,并利用大型语言模型 (LLM) 的能力对文献摘要进行智能分析和总结☆10May 18, 2025Updated last year
- Web Based Iperf Result Real-time Visualization☆17Apr 26, 2019Updated 7 years ago
- 本文提出了一个基于“文心一言”的中国LLMs的安全评估基准,其中包括8种典型的安全场景和6种指令攻击类型。此外,本文还提出了安全评估的框架和过程,利用手动编写和收集开源数据的测试Prompts,以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。☆34Sep 1, 2023Updated 2 years ago
- ☆17May 31, 2023Updated 3 years ago
- N-Tester1.0平台,该项目采用 前后端分离架构,融合 Python 后端框架 FastAPI 和前端主流框架 Vue3 实现统一开发,提供了一站式开箱即用的体验 打造AI结合,支持AI生成用例生成,接口自动化,APP自动化,UI自动化,智能排版,LLM厂商自定义配置…☆91Jun 8, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于Jmeter实现的在线压测平台,在原有版本基础上进行一些个性化的功能添加;本系统在zyanycall/stressTestPlatform的开源项目基础上开发;☆15Dec 17, 2021Updated 4 years ago
- ☆13Feb 15, 2023Updated 3 years ago
- 智慧城市,使用three.js、Vue3、vite☆10Feb 4, 2024Updated 2 years ago
- 微信开源威胁情报机器人☆13Mar 13, 2023Updated 3 years ago
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- 直接解析ngrinder csv结果,统计TPS标准差,TPS波动率,最小/大RT,RT 25/50/75/80/85/90/95/99百分位数; 如需直接在ngrinder详细页展示,需二次开发请查看:☆19Feb 16, 2016Updated 10 years ago
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- 东方隐侠安全团队 AI Agent Skills大集合☆50Mar 22, 2026Updated 2 months ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🔥Sakura Automation Platform🔥是一站式持续自动化平台,涵盖 APP自动化、WEB自动化、API接口自动化、性能自动化,并且支持分布式测试,全面兼容 Appium、Selenium、Rest Assured、JMeter 等主流开源框架,有效助力…☆29Mar 11, 2025Updated last year
- Code for the paper "Abstractive Summarization Guided by Latent Hierarchical Document Structure"☆13May 20, 2023Updated 3 years ago
- LLM 推理服务性能测试☆44Dec 17, 2023Updated 2 years ago
- ☆11Aug 21, 2023Updated 2 years ago
- 景区综合管理平台 ----echats 和 大屏 的完美结合 ,大屏宽度(百分比)高度(rem)自适应☆11Apr 27, 2018Updated 8 years ago
- 基于Doc2vec和Word2vec的句子对匹配方法☆23Jun 3, 2017Updated 9 years ago
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆48Mar 26, 2026Updated 2 months ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- Speaker Diarization using GRU in PyTorch☆11Aug 29, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- Add watermark to PDF and Office files☆16Jul 22, 2017Updated 8 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- Codes简单易用的一站式研发管理平台 :免费使用 、本地安装、研发管理、测试管理、数字大屏、CI CD、接口测试、缺陷管理、DevTestOps☆31Jun 19, 2023Updated 3 years ago
- A chatbot implemented using RNN and GloVe embeddings whch answers your query crazily☆12Jan 1, 2020Updated 6 years ago
- ☆12May 22, 2018Updated 8 years ago
- 【压测引擎】一个简单易用的性能测试平台,前后端分离项目;支持JMeter分布式压测,日志,报告等☆23Apr 15, 2025Updated last year