这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。
☆27Aug 25, 2025Updated 9 months ago
Alternatives and similar repositories for ai-eval-system
Users that are interested in ai-eval-system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于轻量级 Qwen2.5-0.5B 和 SigLIP 的视觉语言多模态模型实现,包含训练和 SFT 代码。分享训练和 SFT 相关代码,记录一下探索和学习的过程。欢迎一起交流讨论~☆20Aug 31, 2025Updated 8 months ago
- 🌍 AI Travel Agent - Intelligent multi-agent system powered by LangGraph that coordinates specialized AI agents (flight search, hotel boo…☆19Nov 13, 2025Updated 6 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆33Aug 5, 2025Updated 9 months ago
- ☆25Mar 10, 2021Updated 5 years ago
- 扣子智能体 API Java SDK 是对扣子智能体的API进行了封装,方便Java开发者接入系统调用。☆31Sep 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 在使用AI工具进行深入分析面试岗位需求与应聘者简历的基础上,依据既定标准向应聘 者提出针对性问题。在应聘者完成问题解答后,将对简历内容及面试表现进行全面评估,并提供具体的改进建 议,以助其提升职业竞争力。☆17Mar 19, 2024Updated 2 years ago
- 致力于分享一些优秀的开源程序和客户端软件。比如商城、小程序、H5、网站、办公系统、OA、CRM、ERP、内容管理系统CMS、物联网系统、智能硬件、人工智能AI、大数据分析、智慧大屏、工具类软件、编程类软件工具、服务器运维、网络安全、前端技术、后台技术等等☆12Jun 15, 2024Updated last year
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆34Aug 12, 2025Updated 9 months ago
- Environments, tools, and benchmarks for general computer agents☆15Dec 3, 2024Updated last year
- ☆17Feb 20, 2023Updated 3 years ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- [ICDE'2026] Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL☆33Mar 25, 2026Updated 2 months ago
- 基于jmeter的性能测试平台☆41Feb 2, 2025Updated last year
- This is the solution for lab work of MIT: 6.5840 - Distributed Systems☆15Oct 30, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- human in the loop in dify workflow by plugin☆16Jan 7, 2025Updated last year
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 11 months ago
- Vim plugins, syntax and indention for Running SAS and editing SAS programs.☆14Apr 7, 2015Updated 11 years ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 9 months ago
- ☆17Apr 11, 2025Updated last year
- 微信开源威胁情报机器人☆12Mar 13, 2023Updated 3 years ago
- 智慧城市,使用three.js、Vue3、vite☆10Feb 4, 2024Updated 2 years ago
- ShinyApps modules for user-handled account registering, password recovery and log-in.☆15Aug 31, 2022Updated 3 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CozeBot-WxworkPro 是一个集成了AI应用开发平台“扣子”的企微脚本,能够快速构建基于大模型的各种Bot,自动处理企业微信中的消息,提高工作效率。☆16Aug 7, 2024Updated last year
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- 网络安全 LLM 智能体应用教程☆29Mar 2, 2025Updated last year
- ☆14Jan 8, 2025Updated last year
- Syntax highlighting for R HTML documentation☆20May 12, 2024Updated 2 years ago
- ☆26Apr 10, 2025Updated last year
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 9 months ago
- 查找漏网之鱼:根据对照名单查找缺漏的名字☆10Dec 22, 2025Updated 5 months ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Universal AI-powered code reviewer using vLLM and/or Ollama provided local LLMs. Works with any language/project. Features persona system…☆43Updated this week
- Luckyexcel node case☆14Nov 19, 2020Updated 5 years ago
- ☆20Feb 24, 2025Updated last year
- Easy Watermark is a simple and easy-to-use watermarking framework that adds watermarks to different types of files using the same method.☆15Oct 12, 2024Updated last year
- 又一个满足你的调用多种大模型API的轮子,支持目前市面多家第三方大模型,包含ChatGPT、通义千问、文心大模型、混元、盘古、百川智能等; 一套写法兼容所有平台,简单配置即可灵活使用第三方大模型API。☆29Sep 16, 2025Updated 8 months ago
- 基于 Jmeter 的轻量级云压测平台☆11Oct 18, 2018Updated 7 years ago
- Data About Emojis☆28Oct 28, 2024Updated last year