allenai / understanding_mcqaLinks
Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"
☆16Updated 2 months ago
Alternatives and similar repositories for understanding_mcqa
Users that are interested in understanding_mcqa are comparing it to the libraries listed below
Sorting:
- Evaluating the Moral Beliefs Encoded in LLMs☆31Updated 10 months ago
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆88Updated this week
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆25Updated 3 months ago
- ☆78Updated 9 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆136Updated 4 months ago
- The official repo for the code and data of paper SMART☆36Updated 8 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆108Updated 3 months ago
- ☆195Updated 6 months ago
- ☆29Updated last year
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆103Updated last week
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆190Updated 8 months ago
- This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen,…☆52Updated 10 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆95Updated 2 years ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆63Updated 10 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆159Updated 8 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆135Updated 2 weeks ago
- ☆102Updated 11 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆64Updated 8 months ago
- For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.☆158Updated this week
- [EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning☆49Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated last year
- ☆30Updated last year
- ☆48Updated 6 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆159Updated 5 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated last year
- [ACL 2025] Knowledge Unlearning for Large Language Models☆45Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- Function Vectors in Large Language Models (ICLR 2024)☆181Updated 6 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆43Updated 8 months ago