☆31Nov 9, 2024Updated last year
Alternatives and similar repositories for mmlu-redux
Users that are interested in mmlu-redux are comparing it to the libraries listed below
Sorting:
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆76Jan 16, 2026Updated 2 months ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Official github repo for E-Eval, a Chinese K12 education evaluation benchmark for LLMs.☆29Feb 19, 2024Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 2 months ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆236Aug 2, 2024Updated last year
- ☆40Aug 21, 2021Updated 4 years ago
- simulate linkstate algorithm for routing☆10Nov 6, 2023Updated 2 years ago
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals☆12May 24, 2024Updated last year
- Use this extension to automate google meet admission.☆11Mar 1, 2021Updated 5 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- ☆11May 18, 2025Updated 10 months ago
- Conceptual Construct Representations☆11Feb 23, 2023Updated 3 years ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Models☆17Jun 28, 2025Updated 8 months ago
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆43May 22, 2025Updated 9 months ago
- Rahnema Final Project - Network anomaly detection☆11Jul 22, 2021Updated 4 years ago
- Use shecan in bash with ease☆15Feb 8, 2019Updated 7 years ago
- ☆11Mar 12, 2021Updated 5 years ago
- Spectral-Spatial MLP Network with Reciprocal Points learning for Open-Set Hyperspectral Image Classification☆16Jul 9, 2023Updated 2 years ago
- Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery☆16Apr 28, 2024Updated last year
- [NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey☆36May 18, 2025Updated 10 months ago
- This is a list of Persian foods☆13Oct 1, 2020Updated 5 years ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 3 months ago
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 5 months ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆35Nov 18, 2025Updated 4 months ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆14Aug 19, 2025Updated 7 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- extension for text WebUI☆20Aug 7, 2025Updated 7 months ago
- Official code of "The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets"☆24Sep 20, 2025Updated 5 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆65May 16, 2025Updated 10 months ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 2 years ago
- A Python module for mapping multiple high-dimensional datasets into a common low-dimensional space.☆10Mar 29, 2018Updated 7 years ago
- This is the source code for "Dream On". An indie game planned to be released in Fall 2021.☆10Aug 19, 2021Updated 4 years ago
- ☆11Oct 17, 2024Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆28Dec 18, 2024Updated last year