justplus / llm-evalLinks
大语言模型评估平台,支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。
☆76Updated 5 months ago
Alternatives and similar repositories for llm-eval
Users that are interested in llm-eval are comparing it to the libraries listed below
Sorting:
- Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool☆637Updated this week
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆62Updated 10 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆626Updated 3 weeks ago
- OpenSearch-SQL code☆166Updated 8 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆380Updated last week
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆244Updated last week
- “AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。☆547Updated last month
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆194Updated last year
- 这个是由清华大学基础模型研究中心主办的《2024金融行业·大模型挑战赛》复赛参赛方案☆57Updated 9 months ago
- XiYanSQL models for Text-to-SQL.☆145Updated 5 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆244Updated this week
- A method and corresponding code for automatic description generation for Text-to-SQL☆107Updated 5 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆60Updated last year
- ☆168Updated 11 months ago
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆213Updated last month
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆233Updated 2 weeks ago
- [EACL'26] DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router☆105Updated last month
- prompt 工程项目案例☆114Updated this week
- ☆130Updated 3 months ago
- a date understanding and reasoning enhanced model☆51Updated 5 months ago
- ☆363Updated last year
- [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆421Updated 5 months ago
- Chat2Graph: Graph Native Agentic System.☆395Updated 3 months ago
- LLaMA Factory Document☆164Updated last week
- A collection of RAG systems powered by LLM.☆216Updated 10 months ago
- 大模型检索增强生成技术最佳实践。☆88Updated last year
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Updated last year
- dify's rag patch module☆277Updated 5 months ago
- 支持查询主流agent框架技术文档的MCP server(支持stdio和sse两种传输协议), 支持 langchain、llama-index、autogen、agno、openai-agents-sdk、mcp-doc、camel-ai 和 crew-ai☆154Updated 9 months ago
- An easy-to-use framework for modular RAG☆432Updated this week