justplus / llm-evalLinks
大语言模型评估平台,支持多种评估基准、自定义数据集和性能测试。支持基于自定义数据集的RAG评估。
☆51Updated last month
Alternatives and similar repositories for llm-eval
Users that are interested in llm-eval are comparing it to the libraries listed below
Sorting:
- OpenSearch-SQL code☆140Updated 3 months ago
- Dingo: A Comprehensive AI Data Quality Evaluation Tool☆466Updated last week
- Agentic RAG R1 Framework via Reinforcement Learning☆297Updated this week
- 智谱AI 2024年金融行业大模型挑战赛仓库☆54Updated 7 months ago
- XiYanSQL models for Text-to-SQL.☆121Updated 2 weeks ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆202Updated last week
- A method and corresponding code for automatic description generation for Text-to-SQL☆91Updated 2 weeks ago
- [Preprint] DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router☆92Updated last month
- Chat2Graph: Graph Native Agentic System.☆349Updated this week
- 这个是由清华大学基础模型研究中心主办的《2024金融行业·大模型挑战赛》复赛参赛方案☆46Updated 4 months ago
- A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬☆61Updated 6 months ago
- dify's rag patch module☆273Updated 2 weeks ago
- [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆337Updated last week
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆349Updated this week
- 大模型检索增强生成技术最佳实践。☆83Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆120Updated 2 months ago
- 探索 LLM 在法律行业的应用潜力☆91Updated 9 months ago
- ☆147Updated 6 months ago
- 全方位大模型评测知识库 | 提示词工程(Prompt Engineer)、各渠道大模型榜单(LeaderBoard)、标杆数据集、安全检测、对抗攻击、智能体、优质数据、文本分类、关系抽取、语音识别、语音合成、多模态、文本生成图片、文本生成视频、点云、智能对话、摘要总结、问答…☆70Updated 10 months ago
- GraphRAG的应用实例,项目特点在于提供了替换OpenAI模型的方法,并通过修改原有提示和切分文档的方法,提高了GraphRAG处理中文内容的能力。☆167Updated 10 months ago
- 个人关于大模型的记忆宝藏☆50Updated 5 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆540Updated 3 months ago
- A collection of RAG systems powered by LLM.☆202Updated 6 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆185Updated 9 months ago
- AI Agent 资源汇总,不限于基础概念、测评基准、Agent SDK、核心论文、开源项目等☆113Updated 2 months ago
- unify-easy-llm(ULM)旨在打造 一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆57Updated last year
- ☆261Updated 9 months ago
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆211Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆209Updated this week
- 支持查询主流agent框架技术文档的MCP server(支持stdio和sse两种传输协议), 支持 langchain、llama-index、autogen、agno、openai-agents-sdk、mcp-doc、camel-ai 和 crew-ai☆136Updated 4 months ago