这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。
☆27Aug 25, 2025Updated 7 months ago
Alternatives and similar repositories for ai-eval-system
Users that are interested in ai-eval-system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 8 months ago
- ☆25Mar 10, 2021Updated 5 years ago
- 致力于分享一些优秀的开源程序和客户端软件。比如商城、小程序、H5、网站、办公系统、OA、CRM、ERP、内容管理系统CMS、物联网系统、智能硬件、人工智能AI、大数据分析、智慧大屏、工具类软件、编程类软件工具、服务器运维、网络安全、前端技术、后台技术等等☆12Jun 15, 2024Updated last year
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆18Nov 24, 2025Updated 4 months ago
- SLM-SQL: An Exploration of Small Language Models for Text-to-SQL☆32Aug 12, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Environments, tools, and benchmarks for general computer agents☆14Dec 3, 2024Updated last year
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- 基于jmeter的性能测试平台☆41Feb 2, 2025Updated last year
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 9 months ago
- 东方隐侠安全团队 AI Agent Skills大集合☆41Mar 22, 2026Updated 3 weeks ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 8 months ago
- 微信开源威胁情报机器人☆13Mar 13, 2023Updated 3 years ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- ☆14Jan 8, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25Apr 10, 2025Updated last year
- Implementation of 12 AI agents evaluation techniques☆39Jul 31, 2025Updated 8 months ago
- 景区综合管理平台 ----echats 和 大屏 的完美结合 ,大屏宽度(百分比)高度(rem)自适应☆11Apr 27, 2018Updated 7 years ago
- 查找漏网之鱼:根据对照名单查找缺漏的名字☆10Dec 22, 2025Updated 3 months ago
- Lark api reverse engineering / 飞书 API 逆向工程☆12Jun 26, 2023Updated 2 years ago
- Universal AI-powered code reviewer using vLLM and/or Ollama provided local LLMs. Works with any language/project. Features persona system…☆38Mar 11, 2026Updated last month
- 智能agent开发的baseline☆27Jul 26, 2025Updated 8 months ago
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated 10 months ago
- ☆19Feb 24, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICML 2024] KnowFormer: Revisiting Transformers for Knowledge Graph Reasoning☆25Sep 20, 2024Updated last year
- A生成测试用例:基于页面和需求文档内容结合自动生成测试用例,解决单个需求文档生成测试用例质量较差的问题。Test Case Generation: Automatically generate test cases based on the current page and…☆54Apr 10, 2025Updated last year
- Instruction Following Eval☆16Jan 16, 2025Updated last year
- 基于 Jmeter 的轻量级云压测平台☆11Oct 18, 2018Updated 7 years ago
- Paper Fetcher Project 是一个开源的 Python 项目,旨在自动化从多种学术资源(例如 ArXiv、Google Scholar 和 PubMed)抓取学术论文的过程。该工具可以定时抓取并去重保存已获取的论文数据,帮助研究人员保持文献的更新和管理。☆24Nov 20, 2024Updated last year
- 这是一个拥有四端的微信机器人应用程序,浏览器客户端(React 全家桶 + Ant Design UI)、监听服务端(TypeScript + Typeorm + RabbitMQ + **Wechaty** + Koa2)、存储服务端(TypeScript + Typeo…☆12Dec 11, 2022Updated 3 years ago
- H1ve-theme和CTFd-owl汉化☆18Nov 10, 2022Updated 3 years ago
- Benchmarking LLM Inference Speeds☆13Apr 7, 2026Updated last week
- ☆11Nov 12, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- nuxt3 three.js 3d地图 大屏可视化模板☆23Jul 19, 2024Updated last year
- I don't want to maintain this project, the code probably won't compile or run. Archived.☆13Feb 25, 2024Updated 2 years ago
- 用 Go 编写的博客爬虫,定期抓取并更新 xargin.com 上的文章信息。程序将文章信息(包括标题、发表时间、阅读时间和 URL)存储在一个 Markdown 文件中,并 使用 GitHub Actions 每小时自动更新。☆11Nov 27, 2024Updated last year
- 用netty实现的简单websocket服务器,根据RFC6455规范实现编/解码器☆20Oct 12, 2017Updated 8 years ago
- ☆20Mar 4, 2025Updated last year
- ☆50Mar 5, 2025Updated last year
- 中文原生多层次文生视频测评基准☆18Jul 8, 2024Updated last year