HiThink-Research / GAGEView external linksLinks
General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.
☆40Updated this week
Alternatives and similar repositories for GAGE
Users that are interested in GAGE are comparing it to the libraries listed below
Sorting:
- BizFinBench.v2: A Unified Offline–Online Bilingual Benchmark for Expert-Level Financial Capability Evaluation of LLMs☆36Jan 29, 2026Updated 2 weeks ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆19Jul 3, 2025Updated 7 months ago
- SEU Summer School project, based on Kotlin and Java.☆13Sep 15, 2023Updated 2 years ago
- [IJCAI'25 Workshop Oral] The 1st place solution of IJCAI 2025 challenge track 1: Image Detection and Localization☆32Dec 4, 2025Updated 2 months ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆14Aug 20, 2025Updated 5 months ago
- ☆10Feb 20, 2023Updated 2 years ago
- Multi-encoder segmentation for contrail detection in satellite imagery | Google Researc☆11Jan 28, 2026Updated 2 weeks ago
- 智能大幅加速南大LMS智慧教育平台课程进度/ 验证码自动识别/ 一键下载所有课件☆37Jan 8, 2026Updated last month
- ☆12Feb 18, 2025Updated 11 months ago
- ☆15Jan 16, 2024Updated 2 years ago
- ☆13Feb 29, 2024Updated last year
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 8 months ago
- Implementation of Wasserstein Generative Adversarial Networks using Tensorflow☆12Jul 25, 2018Updated 7 years ago
- ☆16Nov 29, 2023Updated 2 years ago
- Code of "A Geometric Perspective on Variational Autoencoders" (NeurIPS 2022)☆14Nov 19, 2024Updated last year
- A Pytorch implementation of Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation☆15Nov 28, 2023Updated 2 years ago
- ☆12Feb 26, 2020Updated 5 years ago
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Feb 21, 2025Updated 11 months ago
- ☆11Jun 24, 2021Updated 4 years ago
- The API Traffic Research Dataset Framework (ATRDF). Cisco - Ariel University API Security Detection Challenge 2023.☆17Apr 20, 2025Updated 9 months ago
- ☆14Oct 11, 2023Updated 2 years ago
- ☆17Jan 9, 2025Updated last year
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆17Oct 4, 2024Updated last year
- ☆19Nov 11, 2024Updated last year
- Official PyTorch implementation of the paper "Utilizing Expert Features for Contrastive Learning of Time-Series Representations"☆14Jan 31, 2023Updated 3 years ago
- A Model Context Protocol (MCP) server that provides hourly and daily weather forecasts using the AccuWeather API.☆31Sep 8, 2025Updated 5 months ago
- 提供图片缓存框架简单思路☆18Feb 22, 2018Updated 7 years ago
- Summer course teamwork: a set of cv tools based on PySide6 and opencv☆14Oct 5, 2023Updated 2 years ago
- Code for "When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search" (NeurIPS 2024)☆17Oct 22, 2024Updated last year
- tushare rust mcp server☆22Apr 27, 2025Updated 9 months ago
- An agent with multiple CUHKSZ campus systems connected.☆17Dec 12, 2024Updated last year
- RAPID: Training-free Retrieval-based Log Anomaly Detection with PLM considering Token-level information☆18Jul 11, 2024Updated last year
- [ASE'23] When Less is Enough: Positive-Unlabeled Learning Model for Vulnerability Detection☆16Jan 12, 2024Updated 2 years ago
- [CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering☆21May 28, 2025Updated 8 months ago
- An in-context learning research testbed☆19Mar 16, 2025Updated 10 months ago
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆20May 20, 2023Updated 2 years ago
- Modelling SQL Injection Using Reinforcement Learning☆20Oct 13, 2021Updated 4 years ago
- ☆28Aug 13, 2025Updated 6 months ago
- 基于 skyzh/chicv 制作的简易中文 typst 简历模板 - CV template in Chinese based on skyzh/chicv☆20Oct 12, 2024Updated last year