ALEX-nlp / MUI-EvalLinks
Repository for the paper: Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law
☆5Updated last week
Alternatives and similar repositories for MUI-Eval
Users that are interested in MUI-Eval are comparing it to the libraries listed below
Sorting:
- The related works and background techniques about Openai o1☆221Updated 5 months ago
- Neural Code Intelligence Survey 2024; Reading lists and resources☆260Updated 2 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆221Updated this week
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆631Updated 4 months ago
- Awesome RL-based LLM Reasoning☆511Updated last month
- Paper List for In-context Learning 🌷☆183Updated last year
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆453Updated 7 months ago
- The awesome agents in the era of large language models☆64Updated last year
- Yelp Simulator for WWW'25 AgentSociety Challenge☆80Updated last month
- ☆540Updated 5 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆451Updated 4 months ago
- ☆13Updated last year
- Paper list for Efficient Reasoning.☆467Updated last week
- ☆343Updated 2 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆250Updated last month
- A series of technical report on Slow Thinking with LLM☆685Updated this week
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆231Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆123Updated 8 months ago
- This is the repo for the survey of LLM4IR.☆484Updated 9 months ago
- SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for…☆71Updated 6 months ago
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆626Updated this week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆531Updated last week
- Awesome RL Reasoning Recipes ("Triple R")☆605Updated this week
- LLM hallucination paper list☆316Updated last year
- LEAP is an end-to-end library designed to support social science research by automatically analyzing user-collected unstructured data in …☆15Updated 4 months ago
- classification and solutions for PKU-CSSummerCamp-OnlineJudge☆17Updated last year
- Large Language Models(LLMs) of Code☆18Updated 2 years ago
- ☆56Updated 3 months ago
- 关于LLM和Multimodal LLM的paper list☆40Updated last week
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆424Updated 2 weeks ago