microsoft/MMLU-CF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/MMLU-CF)

microsoft / MMLU-CF

A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]

☆125

Alternatives and similar repositories for MMLU-CF

Users that are interested in MMLU-CF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
SiyangLi99 / open-alteryx-macro
View on GitHub
Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…
☆156May 25, 2024Updated 2 years ago
Nonac / DDOPaI
View on GitHub
☆120Sep 30, 2024Updated last year
shenjunjiekoda / knight
View on GitHub
kight is a static analysis tool for c/c++ programs.
☆213Dec 27, 2024Updated last year
MingXiangL / AttentionShift
View on GitHub
Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation
☆155Oct 18, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ireneli961111 / data-aggregation-federated-learning
View on GitHub
☆142Nov 13, 2024Updated last year
ZivJia / hmi-workspace
View on GitHub
An Workspace for HMI tools
☆163Jul 11, 2024Updated 2 years ago
MingXiangL / DEVIL
View on GitHub
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].
☆274Dec 3, 2024Updated last year
NaishengZhang / book-recommendation-system
View on GitHub
Book Recommendation System
☆234May 2, 2024Updated 2 years ago
yileijin / Bootstrap-GS
View on GitHub
☆251Feb 11, 2025Updated last year
wenlongliaoEE / ETDToolbox
View on GitHub
☆175Feb 21, 2025Updated last year
Nonac / LXD_Build
View on GitHub
This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…
☆91Apr 13, 2024Updated 2 years ago
vortezwohl / Autono
View on GitHub
A ReAct-Based Highly Robust Autonomous Agent (Harness) Framework.
☆211Jun 23, 2026Updated last month
SSSYDYSSS / TransProPy
View on GitHub
A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…
☆251Jan 15, 2026Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
EDEAI / NexusAI
View on GitHub
☆190Dec 30, 2025Updated 6 months ago
0voice / mysql_document
View on GitHub
2024年，整理最全面的mysql资料包，含mysql技术文章，paper，面试题，开源项目，电子书
☆205Dec 16, 2024Updated last year
sql-agi / DB-GPT-X
View on GitHub
☆242Jun 16, 2026Updated last month
BiuYeaf / A-general-framework-to-Prompt-tuning-LLM-model
View on GitHub
☆141May 8, 2024Updated 2 years ago
midori-profile / overlay-video
View on GitHub
🎬 This is a high-performance web animation react component with minimal development cost.
☆89Jun 24, 2024Updated 2 years ago
SSSYDYSSS / MetaTrx
View on GitHub
MetaTrx: Comprehensive Cross-Species Transcriptome Analysis
☆118Jun 4, 2024Updated 2 years ago
Falling-dow / Unsupervised-Image-Enhancement-with-CNN-and-GAN
View on GitHub
Advanced Unsupervised Image Enhancement with GAN
☆247Nov 11, 2024Updated last year
SSSYDYSSS / TransProR
View on GitHub
Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…
☆206Jan 15, 2026Updated 6 months ago
banggx / morgana-form
View on GitHub
莫甘娜问卷表单编辑器，低代码快速搭建表单，AI表单生成，表单数据搜集统计
☆147Jun 21, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kaitoInfra / fast-twitter-api
View on GitHub
Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required
☆183May 28, 2026Updated last month
sjiang325 / Abdominal-Trauma-Detection-code
View on GitHub
☆134Sep 24, 2024Updated last year
YesuLabs / contracts
View on GitHub
☆98Mar 8, 2025Updated last year
Credit-card-monitoring-and-fraud-check / Credit_card_monitoring_and_check
View on GitHub
A code repository designed to show the best GitHub has to offer.
☆165Jun 30, 2024Updated 2 years ago
jtun-coder / JtunRouter
View on GitHub
It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…
☆156Jul 14, 2026Updated last week
HENGRUIZZZZ / GlucoInsight
View on GitHub
GlucoInsight:Framework for Glucose Management Application
☆83Aug 6, 2024Updated last year
risesoft-y9 / Network-Drive
View on GitHub
网络硬盘是通过存储、分类、检索、分享、协作、下发、回收、展示等方式管理文档、文件、图片、音频、视频等资料的工具。网络硬盘擅长在国产的私有化环境中管控文档权限、存储空间分配、安全加密、链接分享，同时支持一定轻量级的文件任务收发。网络硬盘需要依赖开源的数字底座进行人员岗位管控。
☆355Updated this week
SKHon / diudiu
View on GitHub
一个轻量的企业级BFF框架，集成xprofiler能力，可直接使用其强大的监控告警能力。
☆265Feb 7, 2024Updated 2 years ago
johngai19 / TextDistiller
View on GitHub
AI-powered document summarization engine that transforms lengthy texts into crystallized insights
☆146Nov 5, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
gersteinlab / ML-Bench
View on GitHub
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…
☆315Jul 31, 2025Updated 11 months ago
witcherofresearch / Forgedit
View on GitHub
☆284Jul 6, 2024Updated 2 years ago
Davion-Liu / Awesome-Robustness-in-Information-Retrieval
View on GitHub
A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…
☆220Jul 11, 2024Updated 2 years ago
ProjectNeura / LEADS
View on GitHub
Enable your racing car with powerful, data-driven instrumentation, control, and analysis systems, all wrapped up in a gorgeous look.
☆264Jul 9, 2026Updated 2 weeks ago
CGCL-codes / YiTu
View on GitHub
YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…
☆254Jan 7, 2026Updated 6 months ago
orchain / prysm
View on GitHub
☆296Sep 14, 2025Updated 10 months ago
conflow-dev / ConFlow
View on GitHub
☆230Jun 9, 2025Updated last year