General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.
☆43Mar 22, 2026Updated this week
Alternatives and similar repositories for GAGE
Users that are interested in GAGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BizFinBench.v2: A Unified Offline–Online Bilingual Benchmark for Expert-Level Financial Capability Evaluation of LLMs☆40Jan 29, 2026Updated last month
- 提供图片缓存框架简单思路☆18Feb 22, 2018Updated 8 years ago
- A Business-Driven Real-World Financial Benchmark for Evaluating LLMs☆214Jan 9, 2026Updated 2 months ago
- [WACV 2026] PyTorch code for 4D-Animal.☆29Nov 18, 2025Updated 4 months ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆14Aug 20, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Multi-encoder segmentation for contrail detection in satellite imagery | Google Researc☆11Jan 28, 2026Updated last month
- Code of "A Geometric Perspective on Variational Autoencoders" (NeurIPS 2022)☆15Nov 19, 2024Updated last year
- ☆10Feb 20, 2023Updated 3 years ago
- ☆12Feb 26, 2020Updated 6 years ago
- ☆15Jan 16, 2024Updated 2 years ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆23Jul 3, 2025Updated 8 months ago
- ☆14Oct 11, 2023Updated 2 years ago
- A Model Context Protocol (MCP) server that provides hourly and daily weather forecasts using the AccuWeather API.☆32Sep 8, 2025Updated 6 months ago
- ☆28Aug 13, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Feb 18, 2025Updated last year
- ☆13Feb 29, 2024Updated 2 years ago
- A Pytorch implementation of Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation☆15Nov 28, 2023Updated 2 years ago
- A Model Context Protocol (MCP) server for Caiyun (ColorfulClouds) Weather.☆31Mar 1, 2026Updated 3 weeks ago
- Implementation of Wasserstein Generative Adversarial Networks using Tensorflow☆12Jul 25, 2018Updated 7 years ago
- [IJCAI'25 Workshop Oral] The 1st place solution of IJCAI 2025 challenge track 1: Image Detection and Localization☆35Dec 4, 2025Updated 3 months ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 10 months ago
- ☆11Jun 24, 2021Updated 4 years ago
- ☆17Jan 9, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official PyTorch implementation of the paper "Utilizing Expert Features for Contrastive Learning of Time-Series Representations"☆14Jan 31, 2023Updated 3 years ago
- tushare rust mcp server☆27Apr 27, 2025Updated 11 months ago
- ☆18Nov 29, 2023Updated 2 years ago
- ☆24Mar 24, 2019Updated 7 years ago
- SEU Summer School project, based on Kotlin and Java.☆13Sep 15, 2023Updated 2 years ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆18Oct 4, 2024Updated last year
- The API Traffic Research Dataset Framework (ATRDF). Cisco - Ariel University API Security Detection Challenge 2023.☆17Apr 20, 2025Updated 11 months ago
- (deprecated) The Trinket Menu addon for WoW, updated for Classic WoW 1.13☆24Feb 14, 2022Updated 4 years ago
- An agent with multiple CUHKSZ campus systems connected.☆17Dec 12, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆19May 20, 2023Updated 2 years ago
- RAPID: Training-free Retrieval-based Log Anomaly Detection with PLM considering Token-level information☆18Jul 11, 2024Updated last year
- [NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners☆22Jan 26, 2023Updated 3 years ago
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆13Feb 21, 2025Updated last year
- Code for "When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search" (NeurIPS 2024)☆18Oct 22, 2024Updated last year
- 基于 skyzh/chicv 制作的简易中文 typst 简历模板 - CV template in Chinese based on skyzh/chicv☆20Oct 12, 2024Updated last year
- Modelling SQL Injection Using Reinforcement Learning☆20Oct 13, 2021Updated 4 years ago