A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]
☆125May 17, 2025Updated 10 months ago
Alternatives and similar repositories for MMLU-CF
Users that are interested in MMLU-CF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆246Nov 24, 2024Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- ☆120Sep 30, 2024Updated last year
- kight is a static analysis tool for c/c++ programs.☆213Dec 27, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆142Nov 13, 2024Updated last year
- An Workspace for HMI tools☆163Jul 11, 2024Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- Book Recommendation System☆235May 2, 2024Updated last year
- ☆175Feb 21, 2025Updated last year
- ☆251Feb 11, 2025Updated last year
- A ReAct-Based Highly Robust Autonomous Agent (Harness) Framework.☆209Mar 19, 2026Updated 3 weeks ago
- This script allows the server to isolate computational resources through LXD and pre-install PyTorch in order to share GPUs among differe…☆91Apr 13, 2024Updated 2 years ago
- ☆188Dec 30, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 2024年,整理最全面的mysql资料包,含mysql技术文章,paper,面试题,开源项目,电子书☆200Dec 16, 2024Updated last year
- 🎬 This is a high-performance web animation react component with minimal development cost.☆88Jun 24, 2024Updated last year
- ☆141May 8, 2024Updated last year
- ☆241Jul 5, 2024Updated last year
- Advanced Unsupervised Image Enhancement with GAN☆246Nov 11, 2024Updated last year
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Jan 15, 2026Updated 2 months ago
- 莫甘娜问卷表单编辑器,低代码快速搭建表单,AI表单生成,表单数据搜集统计☆147Aug 9, 2024Updated last year
- Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required☆183Jan 6, 2025Updated last year
- ☆134Sep 24, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆98Mar 8, 2025Updated last year
- It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…☆156Dec 19, 2024Updated last year
- MetaTrx: Comprehensive Cross-Species Transcriptome Analysis☆118Jun 4, 2024Updated last year
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…☆205Jan 15, 2026Updated 2 months ago
- GlucoInsight:Framework for Glucose Management Application☆84Aug 6, 2024Updated last year
- AI-powered document summarization engine that transforms lengthy texts into crystallized insights☆145Nov 5, 2024Updated last year
- 网络硬盘是通过存储、分类、检索、分享、协作、下发、回收、展示等方式管理文档、文件、图片、音频、视频等资料的工具。网络硬盘擅长在国产的私有化环境中管控文档权限、存储空间分配、安全加密、链接分享,同时支持一定轻量级的文件任务收发。网络硬盘需要依赖开源的数字底座进行人员岗位管控。☆352Updated this week
- 一个轻量的企业级BFF框架,集成xprofiler能力,可直接使用其强大的监控告警能力。☆264Feb 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆315Jul 31, 2025Updated 8 months ago
- ☆286Jul 6, 2024Updated last year
- ☆297Sep 14, 2025Updated 6 months ago
- This project features optimized Go language, expert source code, concurrent processing, and industry-best practices.☆142Mar 14, 2023Updated 3 years ago
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆221Jul 11, 2024Updated last year
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆147Jun 2, 2023Updated 2 years ago
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 3 months ago