allenai / understanding_mcqaLinks
Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"
☆16Updated 4 months ago
Alternatives and similar repositories for understanding_mcqa
Users that are interested in understanding_mcqa are comparing it to the libraries listed below
Sorting:
- ☆84Updated 10 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen,…☆54Updated last year
- ☆75Updated last year
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆28Updated 5 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆31Updated 11 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆29Updated 9 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆47Updated 2 months ago
- ☆22Updated 11 months ago
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆94Updated last month
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆67Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 11 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆151Updated 5 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 4 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆159Updated 3 weeks ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆130Updated 9 months ago
- The Prism Alignment Project☆86Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆28Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated 2 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆44Updated 8 months ago
- ☆106Updated 7 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆41Updated last year
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆142Updated last month
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated 2 years ago
- ☆53Updated last year
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆168Updated this week
- The official repo for the code and data of paper SMART☆37Updated 9 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆193Updated 9 months ago
- ☆105Updated last year
- [EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners☆26Updated last year