这是一个基于OpenCompass的模型评测系统,该系统提供了前端页面UI以方便用户自助开展评测工作。
☆27Aug 25, 2025Updated 9 months ago
Alternatives and similar repositories for ai-eval-system
Users that are interested in ai-eval-system are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆33Aug 5, 2025Updated 10 months ago
- ☆25Mar 10, 2021Updated 5 years ago
- 扣子智能体 API Java SDK 是对扣子智能体的API进行了封装,方便Java开发者接入系统调用。☆31Sep 25, 2024Updated last year
- [ICDE'2026] Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL☆33Mar 25, 2026Updated 2 months ago
- 智慧城市,使用three.js、Vue3、vite☆10Feb 4, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 微信开源威胁情报机器人☆13Mar 13, 2023Updated 3 years ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- CozeBot-WxworkPro 是一个集成了AI应用开发平台“扣子”的企微脚本,能够快速构建基于大模型的各种Bot,自动处理企业微信中的消息,提高工作效率。☆16Aug 7, 2024Updated last year
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- 网络安全 LLM 智能体应用教程☆29Mar 2, 2025Updated last year
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 10 months ago
- 景区综合管理平台 ----echats 和 大屏 的完美结合 ,大屏宽度(百分比)高度(rem)自适应☆11Apr 27, 2018Updated 8 years ago
- Cross-lingual Event Detection with Prompt Tunning and Prototypical learning☆12May 2, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 查找漏网之鱼:根据对照名单查找缺漏的名字☆10Dec 22, 2025Updated 5 months ago
- ☆12Sep 5, 2022Updated 3 years ago
- 智能agent开发的baseline☆27Jul 26, 2025Updated 10 months ago
- The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …☆11Sep 27, 2024Updated last year
- Universal AI-powered code reviewer using vLLM and/or Ollama provided local LLMs. Works with any language/project. Features persona system…☆43May 30, 2026Updated 2 weeks ago
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated last year
- Easy Watermark is a simple and easy-to-use watermarking framework that adds watermarks to different types of files using the same method.☆15Oct 12, 2024Updated last year
- ☆20Feb 24, 2025Updated last year
- A生成测试用例:基于页面和需求文档内容结合自动生成测试用例,解决单个需求文档生成测试用例质量较差的问题。Test Case Generation: Automatically generate test cases based on the current page and…☆60May 12, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Instruction Following Eval☆17Jan 16, 2025Updated last year
- An easy tool to extract slides from presentations ( lectures 😉 )☆14Dec 10, 2023Updated 2 years ago
- [ACL 2025 Main] Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a d…☆20Jul 5, 2025Updated 11 months ago
- ☆11Nov 12, 2024Updated last year
- 2021腾讯广告算法大赛赛道二神奈川冲浪里(获奖排名第8)☆18May 3, 2022Updated 4 years ago
- nuxt3 three.js 3d地图 大屏可视化模板☆25Jul 19, 2024Updated last year
- ☆49Mar 5, 2025Updated last year
- ☆21Mar 4, 2025Updated last year
- 中文原生多层次文生视频测评基准☆18Jul 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- get the media stream from Dahua/Haikang IPC SDK, and demux the stream to vedio and audio ES☆13Nov 15, 2015Updated 10 years ago
- ☆22Oct 22, 2024Updated last year
- Node-RED AMQP input and output nodes☆16Nov 27, 2019Updated 6 years ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 10 months ago
- HTTP interface testing using JMeter(使用JMeter实现接口测试)☆11Aug 6, 2020Updated 5 years ago
- Wechaty plugin for integrate your bot with weixin openai-sdk☆13Oct 16, 2022Updated 3 years ago
- 🚦 SpeedTracker API layer☆13Jan 19, 2019Updated 7 years ago