technion-cs-nlp/parametric-faithfulness

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/technion-cs-nlp/parametric-faithfulness)

technion-cs-nlp / parametric-faithfulness

☆23

Alternatives and similar repositories for parametric-faithfulness

Users that are interested in parametric-faithfulness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Roxot / mbr-nmt
View on GitHub
Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation
☆16Oct 14, 2022Updated 3 years ago
Heidelberg-NLP / CC-SHAP
View on GitHub
Code for "On Measuring Faithfulness of Natural Language Explanations"
☆23Jul 14, 2026Updated 2 weeks ago
FarnoushRJ / RelP
View on GitHub
[NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…
☆29Nov 3, 2025Updated 8 months ago
zouharvi / subset2evaluate
View on GitHub
Find informative examples to efficiently (human)-evaluate NLG models.
☆17Apr 22, 2026Updated 3 months ago
rycolab / kl-rb
View on GitHub
This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"
☆19May 30, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆13Jun 28, 2026Updated last month
xiye17 / EvalQAExpl
View on GitHub
Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.
☆17Apr 25, 2021Updated 5 years ago
TransluceAI / introspective-interp
View on GitHub
Repository for "Training Language Models To Explain Their Own Computations"
☆23Jul 7, 2026Updated 3 weeks ago
matchten / LoRA-Models-for-SAEs
View on GitHub
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆17Mar 31, 2025Updated last year
jqueeney / robust-safe-rl
View on GitHub
Robust and safe deep reinforcement learning algorithms
☆17Mar 27, 2024Updated 2 years ago
Leukas / CUTE
View on GitHub
☆20Apr 26, 2026Updated 3 months ago
goodfire-ai / scribe-task-suite
View on GitHub
A suite of interpretability tasks to evaluate agents using Scribe for notebook access
☆18Oct 2, 2025Updated 9 months ago
lacoco-lab / decompiling_transformers
View on GitHub
Repo for Paper: Discovering Interpretable Algorithms by Decompiling Transformers to RASP
☆15May 25, 2026Updated 2 months ago
mohsenfayyaz / DecompX
View on GitHub
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]
☆19Jul 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mohsenfayyaz / GlobEnc
View on GitHub
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21May 16, 2023Updated 3 years ago
MurrayTom / SG-Bench
View on GitHub
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
☆26Nov 29, 2024Updated last year
INK-USC / expl-refinement
View on GitHub
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆11Oct 25, 2021Updated 4 years ago
Betswish / MIRAGE
View on GitHub
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆25Mar 10, 2025Updated last year
probabll / dists.pt
View on GitHub
Extensions to torch distributions
☆19Apr 22, 2022Updated 4 years ago
azinmatin / elixir
View on GitHub
ELIXIR: Learning from User Feedback on Explanations to Improve Recommender Models
☆10Feb 15, 2021Updated 5 years ago
successar / FRESH
View on GitHub
☆26Jun 12, 2023Updated 3 years ago
ZurichNLP / mbr
View on GitHub
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆61Jun 3, 2024Updated 2 years ago
EleutherAI / tokengrams
View on GitHub
Efficiently computing & storing token n-grams from large corpora
☆28Jun 15, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
HITsz-TMG / Ext-Sub
View on GitHub
Official implementation of our paper "Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Opera…
☆12Sep 20, 2024Updated last year
aaronmueller / MIB
View on GitHub
Landing page for MIB: A Mechanistic Interpretability Benchmark
☆26Aug 15, 2025Updated 11 months ago
MadryLab / AT2
View on GitHub
Attribute statements generated by LLMs to preceding tokens using attention weights.
☆28Apr 22, 2025Updated last year
jacobkrantz / VertMetric
View on GitHub
VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.
☆12Dec 20, 2018Updated 7 years ago
hannamw / EAP-IG
View on GitHub
☆84May 23, 2026Updated 2 months ago
jvladika / HealthFC
View on GitHub
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
☆14Apr 11, 2025Updated last year
suntea233 / DualLoRA
View on GitHub
Implementation of ACL 2024 paper "Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation".
☆15Nov 9, 2024Updated last year
mt-upc / transformer-contributions
View on GitHub
Measuring the Mixing of Contextual Information in the Transformer
☆35May 27, 2023Updated 3 years ago
ekinakyurek / influence
View on GitHub
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆40Dec 27, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
SakanaAI / TransEvalnia
View on GitHub
Reasoning-based Evaluation and Ranking of Translations.
☆21Jun 2, 2026Updated last month
keyonvafa / sequential-rationales
View on GitHub
Rationales for Sequential Predictions
☆39Mar 10, 2022Updated 4 years ago
tilde-research / activault
View on GitHub
Engine for collecting, uploading, and downloading model activations
☆30Apr 2, 2025Updated last year
anthropics / headvis
View on GitHub
Head Vis Public Release
☆39May 4, 2026Updated 2 months ago
sleepinyourhat / quora-duplicate-questions-util
View on GitHub
Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.
☆14Jan 27, 2017Updated 9 years ago
Heidelberg-NLP / CC-SHAP-VLM
View on GitHub
Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…
☆12Jul 14, 2026Updated 2 weeks ago
cadentj / caft
View on GitHub
☆25Mar 30, 2026Updated 3 months ago