πͺPISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
β12May 30, 2025Updated 11 months ago
Alternatives and similar repositories for PISCES
Users that are interested in PISCES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β19Apr 26, 2026Updated 3 weeks ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.β17Apr 25, 2021Updated 5 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)β15Feb 24, 2026Updated 2 months ago
- β22Aug 30, 2025Updated 8 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.β18Apr 22, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β18Oct 6, 2022Updated 3 years ago
- β14Dec 22, 2025Updated 4 months ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β29Nov 3, 2025Updated 6 months ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.β26Apr 22, 2025Updated last year
- β61May 7, 2026Updated last week
- Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!β19Jun 9, 2023Updated 2 years ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]β19Jul 3, 2025Updated 10 months ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformersβ21May 16, 2023Updated 3 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)β11Oct 25, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β13Jul 26, 2023Updated 2 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β26Mar 10, 2025Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"β30Apr 28, 2023Updated 3 years ago
- Repository for "Training Language Models To Explain Their Own Computations"β22Dec 22, 2025Updated 4 months ago
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.β16Feb 5, 2025Updated last year
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.β12Dec 20, 2018Updated 7 years ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checkingβ13Apr 11, 2025Updated last year
- β11Nov 13, 2021Updated 4 years ago
- Extension injector into Electron appsβ23Aug 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "Tracing Knowledge in Language Models Back to the Training Data"