☆31Nov 9, 2024Updated last year
Alternatives and similar repositories for mmlu-redux
Users that are interested in mmlu-redux are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆79Jan 16, 2026Updated 2 months ago
- 中文原生等级化代码能力测试基准☆15Apr 11, 2024Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 3 months ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆236Aug 2, 2024Updated last year
- simulate linkstate algorithm for routing☆10Nov 6, 2023Updated 2 years ago
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals☆12May 24, 2024Updated last year
- This repository provides the code for the methods and experiments presented in our paper 'Dual-stream Class-adaptive Network for Semi-sup…☆11Feb 29, 2024Updated 2 years ago
- Use this extension to automate google meet admission.☆11Mar 1, 2021Updated 5 years ago
- ☆11May 18, 2025Updated 10 months ago
- Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23☆12Jul 28, 2023Updated 2 years ago
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- Conceptual Construct Representations☆11Feb 23, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Models☆17Jun 28, 2025Updated 9 months ago
- Rahnema Final Project - Network anomaly detection☆11Jul 22, 2021Updated 4 years ago
- compiler project for compiler course (spring 99) in sbu university☆13Nov 21, 2023Updated 2 years ago
- ☆11Mar 12, 2021Updated 5 years ago
- Spectral-Spatial MLP Network with Reciprocal Points learning for Open-Set Hyperspectral Image Classification☆16Jul 9, 2023Updated 2 years ago
- ☆20Mar 5, 2024Updated 2 years ago
- ☆77Jan 24, 2025Updated last year
- [NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey☆36May 18, 2025Updated 10 months ago
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 蚂蚁金融自然语言处理竞赛。☆10Sep 3, 2018Updated 7 years ago
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 6 months ago
- Helm chart for tile38☆15Mar 30, 2026Updated last week
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 4 months ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆14Aug 19, 2025Updated 7 months ago
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆29Dec 20, 2024Updated last year
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- extension for text WebUI☆20Aug 7, 2025Updated 8 months ago
- Official code of "The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets"☆25Mar 24, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆66May 16, 2025Updated 10 months ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆19May 2, 2025Updated 11 months ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Network☆12May 30, 2018Updated 7 years ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- ☆20May 5, 2023Updated 2 years ago
- [CVPR'24] Solving the Catastrophic Forgetting Problem in Generalized Category Discovery https://arxiv.org/pdf/2501.05272☆16Dec 24, 2024Updated last year
- This is the source code for "Dream On". An indie game planned to be released in Fall 2021.☆10Aug 19, 2021Updated 4 years ago