大语言模型评估平台,支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。
☆89Aug 20, 2025Updated 9 months ago
Alternatives and similar repositories for llm-eval
Users that are interested in llm-eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于jmeter的性能测试平台☆41Feb 2, 2025Updated last year
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,835Updated this week
- fufan-chat-api的前端项目☆28Nov 1, 2024Updated last year
- 等保测评文档☆13Dec 18, 2018Updated 7 years ago
- Spring boot 整合 Activiti Modeler编辑器☆13Oct 12, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A code security platform based on fortify sca windows☆15Mar 6, 2019Updated 7 years ago
- A practical utility library for LangChain and LangGraph development☆105Mar 4, 2026Updated 2 months ago
- 灵猫智能管理平台是一个在线web测试项目与测试工具管理平台,通过灵猫智能快速敏捷的灵活性,实现项目管理、用例管理、模块管理、UI自动化测试管理、小工具应用等等系统的测试☆11Jun 21, 2021Updated 4 years ago
- A helm chart for deploying Neoload Web on your Kubernetes cluster☆13Updated this week
- support for elasticsearch 6.3.0☆10Jul 6, 2018Updated 7 years ago
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆40Sep 12, 2025Updated 8 months ago
- 一个基于 模型上下文协议/MCP 构建的智能医学文献分析工具。它旨在帮助科研人员、医学从业者和学生快速检索 PubMed 数据库,并利用大型语言模型 (LLM) 的能力对文献摘要进行智能分析和总结☆10May 18, 2025Updated last year
- Web Based Iperf Result Real-time Visualization☆17Apr 26, 2019Updated 7 years ago
- This project is a deliberately vulnerable environment to learn about LLM-specific risks based on the OWASP Top 10 for LLM Applications.☆51Jan 19, 2026Updated 4 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆17May 31, 2023Updated 2 years ago
- 微信开源威胁 情报机器人☆12Mar 13, 2023Updated 3 years ago
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- ☆10May 25, 2015Updated 11 years ago
- Code for the paper "Abstractive Summarization Guided by Latent Hierarchical Document Structure"☆13May 20, 2023Updated 3 years ago
- ☆17Mar 7, 2024Updated 2 years ago
- LLM 推理服务性能测试☆44Dec 17, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Aug 21, 2023Updated 2 years ago
- A very simple chat application using Spring Boot, Vue.js (in TypeScript), gRPC, gRPC-Web and EnvoyProxy.☆10May 20, 2019Updated 7 years ago
- 基于Doc2vec和Word2vec的句子对匹配方法☆23Jun 3, 2017Updated 8 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- Speaker Diarization using GRU in PyTorch☆11Aug 29, 2020Updated 5 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- A Tight-fisted Optimizer (Tiger), implemented in PyTorch.☆12Jun 26, 2024Updated last year
- Codes简单易用的一站式研发管理平台 :免费使用 、本地安装、研发管理、测试管理、数字大屏、CI CD、接口测试、缺陷管理、DevTestOps☆29Jun 19, 2023Updated 2 years ago
- A chatbot implemented using RNN and GloVe embeddings whch answers your query crazily☆12Jan 1, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A modular and stable agent sandbox runtime environment.☆52Updated this week
- ☆22Dec 7, 2021Updated 4 years ago
- A bert-fusing architecture for twitter sentiment analysis. accepted in AACL-IJCNLP 2020 Student Research Workshop.☆11Jun 12, 2023Updated 2 years ago
- ☆10Jul 18, 2024Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 10 months ago
- H1ve-theme和CTFd-owl汉化☆18Nov 10, 2022Updated 3 years ago