这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。
☆27Aug 25, 2025Updated 7 months ago
Alternatives and similar repositories for ai-eval-system
Users that are interested in ai-eval-system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Aug 5, 2025Updated 7 months ago
- ☆25Mar 10, 2021Updated 5 years ago
- 扣子智能体 API Java SDK 是对扣子智能体的API进行了封装,方便Java开发者接入系统调用。☆30Sep 25, 2024Updated last year
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆18Nov 24, 2025Updated 4 months ago
- Environments, tools, and benchmarks for general computer agents☆14Dec 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17Feb 20, 2023Updated 3 years ago
- 用VBA 在officer word开发工具中建立一个AI生成器,调用AI大模型API(如Deepseek\Qwen/Qwen2.5),返回所需要的结果插入word文档。☆19Feb 7, 2025Updated last year
- [ICDE 2026] Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL☆27Mar 16, 2026Updated last week
- 基于jmeter的性能测试平台☆41Feb 2, 2025Updated last year
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 9 months ago
- human in the loop in dify workflow by plugin☆15Jan 7, 2025Updated last year
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 7 months ago
- ☆17Apr 11, 2025Updated 11 months ago
- 微信开源威胁情报机器人☆13Mar 13, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 智慧城市,使用three.js、Vue3、vite☆10Feb 4, 2024Updated 2 years ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- CozeBot-WxworkPro 是一个集成了AI应用开发平台“扣子”的企微脚本,能够快速构建基于大模型的各种Bot,自动处理企业微信中的消息,提高工作效率。☆15Aug 7, 2024Updated last year
- Implementation of 12 AI agents evaluation techniques☆37Jul 31, 2025Updated 7 months ago
- 查找漏网之鱼:根据对照名单查找 缺漏的名字☆10Dec 22, 2025Updated 3 months ago
- Lark api reverse engineering / 飞书 API 逆向工程☆12Jun 26, 2023Updated 2 years ago
- 智能agent开发的baseline☆27Jul 26, 2025Updated 8 months ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- ☆19Feb 24, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Easy Watermark is a simple and easy-to-use watermarking framework that adds watermarks to different types of files using the same method.☆15Oct 12, 2024Updated last year
- [ICML 2024] KnowFormer: Revisiting Transformers for Knowledge Graph Reasoning☆25Sep 20, 2024Updated last year
- A生成测试用例:基于页面和需求文档内容结合自动生成测试用例,解决单个需求文档生成测试用例质量较差的问题。Test Case Generation: Automatically generate test cases based on the current page and…☆53Apr 10, 2025Updated 11 months ago
- 基于 Jmeter 的轻量级云压测平台☆11Oct 18, 2018Updated 7 years ago
- An easy tool to extract slides from presentations ( lectures 😉 )☆14Dec 10, 2023Updated 2 years ago
- H1ve-theme和CTFd-owl汉化☆18Nov 10, 2022Updated 3 years ago
- Benchmarking LLM Inference Speeds☆13Mar 3, 2026Updated 3 weeks ago
- ☆13Sep 18, 2024Updated last year
- I don't want to maintain this project, the code probably won't compile or run. Archived.☆13Feb 25, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 前端性能监控工具☆11Aug 19, 2020Updated 5 years ago
- 用 Go 编写的博客爬虫,定期抓取并更新 xargin.com 上的文章信息。程序将文章信息(包括标题、发表时间、阅读时间和 URL)存储在一个 Markdown 文件中,并使用 GitHub Actions 每小时自动更新。☆11Nov 27, 2024Updated last year
- Python - 100天从新手到大师☆10Oct 14, 2021Updated 4 years ago
- ☆20Mar 4, 2025Updated last year
- 🎤 开源语音输入工具 | 比 Typeless 更早的免费方案!支持豆包流式ASR、OpenAI GPT-4o Transcribe、本地Whisper。按下快捷键说话,文字自动输入到光标处。☆37Mar 16, 2026Updated last week
- 一个简单的WPS插 件,将代码复制到对应路径即可将DeepSeek等主流工具插入到WPS中☆36Feb 12, 2025Updated last year
- ☆50Mar 5, 2025Updated last year