edenbiran / HoppingTooLateView external linksLinks
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆32Mar 2, 2025Updated 11 months ago
Alternatives and similar repositories for HoppingTooLate
Users that are interested in HoppingTooLate are comparing it to the libraries listed below
Sorting:
- ☆70Mar 6, 2025Updated 11 months ago
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆25Dec 20, 2024Updated last year
- ☆10Nov 6, 2024Updated last year
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- [ACL'2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"☆13Aug 28, 2024Updated last year
- Attribution-based Parameter Decomposition☆33Jun 11, 2025Updated 8 months ago
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Apr 3, 2022Updated 3 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- ☆39Jun 11, 2025Updated 8 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆24Feb 6, 2025Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 3 months ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆64Oct 27, 2024Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆72Jan 16, 2026Updated last month
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)☆32Jul 3, 2024Updated last year
- ☆70Jun 18, 2025Updated 7 months ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆297Feb 8, 2026Updated last week
- The official repository containing the source code to the explAIner publication.☆32Apr 29, 2024Updated last year
- ☆29Apr 30, 2024Updated last year
- This repository collects all relevant resources about interpretability in LLMs☆387Nov 1, 2024Updated last year
- A library for efficient patching and automatic circuit discovery.☆88Dec 31, 2025Updated last month
- Methods and evaluation for aligning language models temporally☆30Mar 2, 2024Updated last year
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- ☆83Feb 25, 2025Updated 11 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆235Jul 19, 2025Updated 6 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 8 months ago
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated 11 months ago
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆163Nov 14, 2025Updated 3 months ago
- GPTSiteCrawler☆13Nov 20, 2023Updated 2 years ago
- A Python script that saves your CrewAI agents crew output to a Notion Database☆14Feb 17, 2024Updated last year
- Evaluation Pipeline for medical tasks.☆12Updated this week
- The main controller for services in the cs-insights project through docker-compose.☆13Aug 25, 2023Updated 2 years ago
- my profile readme☆14Updated this week
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆51Nov 8, 2024Updated last year