yoavgur/PISCES

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yoavgur/PISCES)

yoavgur / PISCES

🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models

☆13

Alternatives and similar repositories for PISCES

Users that are interested in PISCES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiye17 / EvalQAExpl
View on GitHub
Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.
☆17Apr 25, 2021Updated 5 years ago
visinf / fast-axiomatic-attribution
View on GitHub
Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)
☆15Feb 24, 2026Updated 4 months ago
zouharvi / subset2evaluate
View on GitHub
Find informative examples to efficiently (human)-evaluate NLG models.
☆17Apr 22, 2026Updated 2 months ago
mt-upc / transformer-contributions-nmt
View on GitHub
☆18Oct 6, 2022Updated 3 years ago
HiLab-git / LN-Seg-FM
View on GitHub
☆14Dec 22, 2025Updated 6 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Leukas / CUTE
View on GitHub
☆20Apr 26, 2026Updated 2 months ago
mitvis / saliency-cards
View on GitHub
Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!
☆19Jun 9, 2023Updated 3 years ago
mohsenfayyaz / DecompX
View on GitHub
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]
☆19Jul 3, 2025Updated last year
mohsenfayyaz / GlobEnc
View on GitHub
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21May 16, 2023Updated 3 years ago
INK-USC / expl-refinement
View on GitHub
Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)
☆11Oct 25, 2021Updated 4 years ago
Betswish / MIRAGE
View on GitHub
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆25Mar 10, 2025Updated last year
GChrysostomou / ood_faith
View on GitHub
☆13Jul 26, 2023Updated 2 years ago
yonatan-mitmit / onium
View on GitHub
Extension injector into Electron apps
☆23Aug 10, 2023Updated 2 years ago
allenai / few_shot_explanations
View on GitHub
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆29Apr 28, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
technion-cs-nlp / parametric-faithfulness
View on GitHub
☆23Aug 30, 2025Updated 10 months ago
MadryLab / AT2
View on GitHub
Attribute statements generated by LLMs to preceding tokens using attention weights.
☆28Apr 22, 2025Updated last year
TransluceAI / circuits
View on GitHub
ADAG: Transluce's MLP neuron-level circuit tracing library
☆33Apr 10, 2026Updated 3 months ago
jacobkrantz / VertMetric
View on GitHub
VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.
☆12Dec 20, 2018Updated 7 years ago
ZBox1005 / AgentForesight
View on GitHub
AgentForesight: Online Auditing for Early Failure Prediction in Multi-Agent Systems
☆16May 12, 2026Updated 2 months ago
HKUST-KnowComp / PrivLM-Bench
View on GitHub
Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.
☆16Feb 5, 2025Updated last year
davidenitti / ML
View on GitHub
☆11Nov 13, 2021Updated 4 years ago
jvladika / HealthFC
View on GitHub
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
☆14Apr 11, 2025Updated last year
mt-upc / transformer-contributions
View on GitHub
Measuring the Mixing of Contextual Information in the Transformer
☆35May 27, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ekinakyurek / influence
View on GitHub
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆40Dec 27, 2022Updated 3 years ago
yingqichao / imuge_plus
View on GitHub
☆19Oct 10, 2024Updated last year
yihuaihong / ConceptVectors
View on GitHub
[EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"
☆40Aug 20, 2025Updated 11 months ago
Sahardastani / spectral_vmamba
View on GitHub
[CVPR 2025] Spectral State Space Model for Rotation-Invariant Visual Representation Learning
☆18Oct 13, 2025Updated 9 months ago
FarnoushRJ / RelP
View on GitHub
[NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…
☆29Nov 3, 2025Updated 8 months ago
keyonvafa / sequential-rationales
View on GitHub
Rationales for Sequential Predictions
☆39Mar 10, 2022Updated 4 years ago
milesaturpin / cot-unfaithfulness
View on GitHub
☆57Oct 23, 2023Updated 2 years ago
anthropics / headvis
View on GitHub
Head Vis Public Release
☆39May 4, 2026Updated 2 months ago
Heidelberg-NLP / CC-SHAP-VLM
View on GitHub
Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…
☆12Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
muharamdani / notebooklm-categorizer
View on GitHub
NotebookLM Project Categorizer
☆28Nov 8, 2025Updated 8 months ago
mega002 / DocQN
View on GitHub
Author implementation of "Learning to Search in Long Documents Using Document Structure" (Mor Geva and Jonathan Berant, 2018)
☆22Jul 12, 2018Updated 8 years ago
LUMIA-Group / HuRef
View on GitHub
Official implementation for "HuRef: HUman-REadable Fingerprint for Large Language Models" (NeurIPS2024)
☆16Jun 17, 2025Updated last year
huashen218 / convxai
View on GitHub
CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing
☆14Jun 25, 2023Updated 3 years ago
UKPLab / TWEAC-qa-agent-selection
View on GitHub
☆20Apr 16, 2021Updated 5 years ago
gidim / Babler
View on GitHub
Data Collection System For NLP/Speech Recognition
☆25Apr 20, 2021Updated 5 years ago
au-clan / cachesaver
View on GitHub
☆30Feb 11, 2026Updated 5 months ago