yoavgur / PISCESLinks
πͺPISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
β11Updated 6 months ago
Alternatives and similar repositories for PISCES
Users that are interested in PISCES are comparing it to the libraries listed below
Sorting:
- Find informative examples to efficiently (human)-evaluate NLG models.β17Updated last week
- Measuring the Mixing of Contextual Information in the Transformerβ33Updated 2 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.β19Updated 7 months ago
- β29Updated last year
- β39Updated 4 years ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β21Updated last month
- β90Updated 3 years ago
- The geometry of multilingual language model representations (EMNLP 2022).β22Updated 3 years ago
- Repository for DEMETR: Diagnosing Evaluation Metrics for Translationβ17Updated 3 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"β12Updated 4 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β25Updated 9 months ago
- https://arxiv.org/abs/2404.10917β14Updated 8 months ago
- β47Updated last year
- Codes for "Benchmarking the Generation of Fact Checking Explanations"β10Updated last year
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible:Β Code and Dataβ14Updated 3 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021β14Updated 3 years ago
- A software for transferring pre-trained English models to foreign languagesβ19Updated 2 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".β88Updated 4 years ago
- β15Updated 4 years ago
- β24Updated 4 years ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]β18Updated 5 months ago
- β113Updated 3 years ago
- A framework for evaluating Machine Translation models.β11Updated 6 months ago
- Landing page for MIB: A Mechanistic Interpretability Benchmarkβ21Updated 4 months ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 π πβ15Updated last year
- β58Updated 3 years ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformersβ21Updated 2 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper heβ¦β27Updated 4 months ago
- β17Updated 3 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.β42Updated 2 years ago