大语言模型评估平台,支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。
☆79Aug 20, 2025Updated 6 months ago
Alternatives and similar repositories for llm-eval
Users that are interested in llm-eval are comparing it to the libraries listed below
Sorting:
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆36Sep 12, 2025Updated 5 months ago
- fufan-chat-api的前端项目☆27Nov 1, 2024Updated last year
- This repository to demonstrate an application built with Java 21 + SrpingBoot 3 + MyBatis including CRUD operations, authentication, rout…☆12Dec 1, 2024Updated last year
- 基于jmeter的性能测试平台☆39Feb 2, 2025Updated last year
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,436Updated this week
- 一个基于FastAPI和React的智能体系统,支持多智能体管理、mcp管理、知识库、聊天对话等功能。An intelligent agent system based on FastAPI and React, supporting multi-agent managem…☆21Jan 25, 2026Updated last month
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆25Oct 4, 2025Updated 4 months ago
- rag base on langchain☆11Mar 1, 2024Updated 2 years ago
- A practical utility library for LangChain and LangGraph development☆100Feb 21, 2026Updated last week
- 基于检索增强生成(RAG)技术的ICD-10医疗诊断内容标准化工具,支持中文医学术语的智能匹配和标准化。☆17Aug 12, 2025Updated 6 months ago
- dify 知识库检索工具☆13Apr 3, 2025Updated 10 months ago
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- ☆11Jan 13, 2023Updated 3 years ago
- LLM 推理服务性能测试☆44Dec 17, 2023Updated 2 years ago
- This is a depth-anything-v2 onnxruntime inference by cpp☆15Sep 2, 2024Updated last year
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 7 months ago
- Unofficial docker wrapper for Qualcomm SNPE(Snapdragon Neural Processing Engine) SDK☆11Mar 3, 2022Updated 3 years ago
- ☆10Feb 4, 2016Updated 10 years ago
- tensorrt部署教程☆11Aug 1, 2025Updated 7 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- ☆12Updated this week
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆15Jan 23, 2024Updated 2 years ago
- 动手写全文搜索引擎☆10Aug 12, 2020Updated 5 years ago
- yolov8在hisi3536a推理☆11Dec 15, 2023Updated 2 years ago
- Deliver LLMs of GGUF format via Dockerfile.☆15Oct 24, 2024Updated last year
- ☆13Mar 16, 2025Updated 11 months ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- Docker&vLLM官方镜像部署DeepSeek模型,在生产环境中提供类OpenAI接口服务。☆15Jul 17, 2025Updated 7 months ago
- 标注自己的数据集,训练、评估、测试、部署自己的人工智能算法☆11May 28, 2024Updated last year
- 简单rtsp服务器,支持264 aac☆12Dec 17, 2021Updated 4 years ago
- 一个强大的、由 AI 驱动的演示文稿(PPt)自动化生成工具,真正生产化的工具,全流程可控,帮助用户快速制作出符合需求的 PPt。☆26Sep 23, 2025Updated 5 months ago
- 灵芝IAST是一款交互式应用安全评估工具,覆盖了Java WEB相关安全风险的检测,具有近实时检测、准确率高、误报率低、漏洞链路清晰等特点|使用之前请阅读官方文档☆16Jul 18, 2020Updated 5 years ago
- 本项目旨在利用LangChain和大语言模型(如ZhipuAI)开发一个智能数据库问答系统。 该系统能够通过自然语言理解用户的查询请求,自动生成相应的SQL语句并执行,最后将查询结果以自然语言 形式返回用户。☆17Jul 31, 2024Updated last year
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- A very simple chat application using Spring Boot, Vue.js (in TypeScript), gRPC, gRPC-Web and EnvoyProxy.☆10May 20, 2019Updated 6 years ago
- Rust based Diablo 2 hack.☆13Jan 27, 2024Updated 2 years ago
- ☆23Nov 14, 2025Updated 3 months ago
- 简单快速的部署深度学习模型☆13Sep 3, 2023Updated 2 years ago
- AI assistant application based on Next.js, supporting Dialog Management and external Skill (script/tool/rule) integration. AI autonomousl…☆37Feb 9, 2026Updated 3 weeks ago