[ICLR 2025] đ CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.
â29Apr 21, 2025Updated 10 months ago
Alternatives and similar repositories for CodeMMLU
Users that are interested in CodeMMLU are comparing it to the libraries listed below
Sorting:
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELSâ46Jan 10, 2026Updated last month
- reference code for tensorflowâ13Jul 31, 2019Updated 6 years ago
- Sketch Driven Regular Expression Generation.â17Apr 26, 2023Updated 2 years ago
- A simple, easy-to-customize pipeline for local RAG evaluation. Starter prompts and metric definitions included.â25Jan 14, 2026Updated last month
- Faster Whisper ASR transcription with CTranslate2â24Oct 25, 2024Updated last year
- â22Jan 3, 2025Updated last year
- â24Jan 19, 2022Updated 4 years ago
- â23Sep 18, 2023Updated 2 years ago
- The DPAB-Îą Benchmarkâ32Jan 15, 2025Updated last year
- LLM Benchmark for Codeâ32Aug 6, 2024Updated last year
- â31Nov 14, 2024Updated last year
- URS Benchmark: Evaluating LLMs on User Reported Scenariosâ30May 30, 2025Updated 9 months ago
- Rethinking the User Interface of AIâ32Updated this week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.â32Sep 19, 2025Updated 5 months ago
- Training and Benchmarking LLMs for Code Preference.â38Nov 15, 2024Updated last year
- SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systemsâ10Apr 11, 2025Updated 10 months ago
- â10Aug 7, 2024Updated last year
- â13Nov 5, 2024Updated last year
- Code implementation for CoTexT: Multi-task Learning with Code-Text Transformerâ36Sep 14, 2021Updated 4 years ago
- â16May 13, 2021Updated 4 years ago
- LUKSO dApps template in Next.jsâ11Jan 7, 2025Updated last year
- An Abstractive Summarization(for Datasets in English format) Implementation with Transformer and Pointer-generatorâ12Dec 31, 2020Updated 5 years ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scriptingâ17Nov 28, 2025Updated 3 months ago
- An simplest PE parser, which list all import and export entriesâ12Oct 11, 2018Updated 7 years ago
- Media around Buildbot - images, slides, papers, etc.â13Oct 6, 2019Updated 6 years ago
- rag-pinecone-rayâ11Aug 14, 2023Updated 2 years ago
- â11Jul 20, 2021Updated 4 years ago
- â13Jan 23, 2023Updated 3 years ago
- A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.â12Jun 24, 2024Updated last year
- Open Source Multivalue String Databaseâ13Feb 16, 2026Updated last week
- Attempt of Operating System (C++)â21Oct 6, 2025Updated 4 months ago
- Python3å Ĩ鍿ēå¨åĻäš įģå ¸įŽæŗä¸åē፠åĻäšâ11Nov 9, 2018Updated 7 years ago
- â19Jan 15, 2026Updated last month
- This repo is the artifact of FUELâ13Dec 2, 2025Updated 2 months ago
- â41Jan 13, 2023Updated 3 years ago
- Streamline on-policy/off-policy distillation workflows in a few lines of codeâ95Updated this week
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentationâ11Mar 7, 2023Updated 2 years ago
- Agent installed on node to launch IDA,Bindiff,... and send results to the server ( AutoDiffWeb )â10Mar 25, 2016Updated 9 years ago
- A realtime speech to text diarization system to gather and interleave speech from multiple speaker audio.â25Jan 29, 2026Updated last month