technion-cs-nlp / parametric-faithfulnessLinks
β16Updated 5 months ago
Alternatives and similar repositories for parametric-faithfulness
Users that are interested in parametric-faithfulness are comparing it to the libraries listed below
Sorting:
- Landing page for MIB: A Mechanistic Interpretability Benchmarkβ24Updated 5 months ago
- πͺPISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Modelsβ12Updated 8 months ago
- Measuring the Mixing of Contextual Information in the Transformerβ34Updated 2 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β26Updated 10 months ago
- The geometry of multilingual language model representations (EMNLP 2022).β22Updated 3 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.β21Updated 9 months ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.β57Updated 3 months ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"β39Updated 3 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.β17Updated last week
- Data for evaluating gender bias in coreference resolution systems.β81Updated 6 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.β42Updated 2 years ago
- β47Updated 2 years ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]β19Updated 6 months ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 π πβ15Updated last year
- β116Updated last year
- β32Updated 11 months ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the pβ¦β12Updated last year
- β24Updated 4 years ago
- A software for transferring pre-trained English models to foreign languagesβ19Updated 2 years ago
- β17Updated 7 months ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".β89Updated 4 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Languageβ43Updated 2 years ago
- Highlight errors in a bib file: missing URLs, capitalization protection, etcβ27Updated last year
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformersβ42Updated 11 months ago
- β29Updated last year
- β18Updated 3 years ago
- Benchmark API for Multidomain Language Modelingβ25Updated 3 years ago
- Sparse probing paper full code.β66Updated 2 years ago
- This repository accompanies our paper βDo Prompt-Based Models Really Understand the Meaning of Their Prompts?ββ85Updated 3 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paperβ85Updated 4 years ago