Repository for the paper: Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law
☆12Aug 16, 2025Updated 7 months ago
Alternatives and similar repositories for MUI-Eval
Users that are interested in MUI-Eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visual Storytelling API☆36Feb 11, 2017Updated 9 years ago
- USTC OSH 2023 course homepage☆13Jul 27, 2023Updated 2 years ago
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆18Mar 13, 2023Updated 3 years ago
- Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)☆21Nov 21, 2024Updated last year
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆25Dec 1, 2024Updated last year
- ☆15Aug 14, 2025Updated 7 months ago
- ☆14Jul 9, 2018Updated 7 years ago
- Fine-grained named entity recognition using BERT☆11Feb 5, 2020Updated 6 years ago
- 监控合肥工业大学宣城校区官网通知变化情况,并发送邮件进行通知☆10Jun 1, 2021Updated 4 years ago
- 人工智能:爬山法、随机重启爬山法、模拟退火算法、遗传算法、启发式搜索方法解决八数码和八皇后问题☆11Jul 15, 2021Updated 4 years ago
- A proof-of-concept rec.ustc.edu.cn client☆15Dec 25, 2023Updated 2 years ago
- The datasets and source code of the NDSS 2025 paper《BinEnhance: An Enhancement Framework Based on External Environment Semantics for Bina…☆30Nov 13, 2025Updated 4 months ago
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆137Jul 8, 2024Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Jan 23, 2024Updated 2 years ago
- Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning☆22Jun 26, 2025Updated 9 months ago
- JSSP dataset for LLMs☆16May 29, 2025Updated 9 months ago
- ☆43Feb 22, 2026Updated last month
- ☆21Aug 3, 2021Updated 4 years ago
- Self Tuned Openwrt for NanoPi R2S☆11May 11, 2025Updated 10 months ago
- Using some AI tools to auto-play Magicraft game.☆63Dec 26, 2025Updated 3 months ago
- A paper list of research conducted based on wikiHow☆27Mar 5, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Test suite for seL4.☆30Mar 16, 2026Updated last week
- Solution Markdown Template For Algorithm Contest☆30Sep 22, 2024Updated last year
- [ECCV 2024] "Prediction Exposes Your Face: Black-box Model Inversion via Prediction Alignment"☆15Mar 12, 2025Updated last year
- 课程主页☆32Jul 16, 2020Updated 5 years ago
- a basic jvm☆12Jan 22, 2018Updated 8 years ago
- On Predictability of Reinforcement Learning Dynamics for Large Language Models (ICLR 2026)☆155Jan 27, 2026Updated 2 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆47Oct 20, 2025Updated 5 months ago
- parse_type extends the "parse" module (opposite of "string.format()")☆20Aug 11, 2025Updated 7 months ago
- Source code for IJCKG 2021 paper "FedE: Embedding Knowledge Graphs in Federated Setting"☆25Apr 15, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multimodal entity linking for Tweets☆29Aug 30, 2021Updated 4 years ago
- Hardware-centric Linux kernel debloater☆15Nov 28, 2023Updated 2 years ago
- 学习过程中积累的一些笔记☆33Mar 9, 2023Updated 3 years ago
- ☆12Nov 30, 2018Updated 7 years ago
- Korea Girls High School's Inappropriate Uniform Detection Service☆16Apr 5, 2022Updated 3 years ago
- gRPC service for Zhejiang University Intl Campus.☆12Feb 17, 2019Updated 7 years ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆51Nov 9, 2024Updated last year