πͺPISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
β12May 30, 2025Updated 10 months ago
Alternatives and similar repositories for PISCES
Users that are interested in PISCES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β19Sep 16, 2025Updated 6 months ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.β17Apr 25, 2021Updated 4 years ago
- β17Aug 30, 2025Updated 7 months ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)β15Feb 24, 2026Updated last month
- Find informative examples to efficiently (human)-evaluate NLG models.β18Feb 27, 2026Updated last month
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- β18Oct 6, 2022Updated 3 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.β24Apr 22, 2025Updated 11 months ago
- β33Feb 15, 2026Updated last month
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β27Nov 3, 2025Updated 5 months ago
- Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!β19Jun 9, 2023Updated 2 years ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]β19Jul 3, 2025Updated 9 months ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformersβ21May 16, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)β11Oct 25, 2021Updated 4 years ago
- β13Jul 26, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β26Mar 10, 2025Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"β30Apr 28, 2023Updated 2 years ago
- Repository for "Training Language Models To Explain Their Own Computations"β21Dec 22, 2025Updated 3 months ago
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.β16Feb 5, 2025Updated last year
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.β11Dec 20, 2018Updated 7 years ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checkingβ13Apr 11, 2025Updated 11 months ago
- β11Nov 13, 2021Updated 4 years ago
- Extension injector into Electron appsβ23Aug 10, 2023Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"β39Dec 27, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Measuring the Mixing of Contextual Information in the Transformerβ34May 27, 2023Updated 2 years ago
- [CVPR 2025] Spectral State Space Model for Rotation-Invariant Visual Representation Learningβ18Oct 13, 2025Updated 5 months ago
- Rationales for Sequential Predictionsβ40Mar 10, 2022Updated 4 years ago
- Dangerous Dave reverse engineering and level editing utilityβ23Oct 25, 2024Updated last year
- β52Oct 23, 2023Updated 2 years ago
- [EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"β39Aug 20, 2025Updated 7 months ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"β16Aug 11, 2023Updated 2 years ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explβ¦β12Apr 4, 2025Updated last year
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writingβ14Jun 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- β18Apr 16, 2021Updated 4 years ago
- Author implementation of "Learning to Search in Long Documents Using Document Structure" (Mor Geva and Jonathan Berant, 2018)β22Jul 12, 2018Updated 7 years ago
- Official implementation for "HuRef: HUman-REadable Fingerprint for Large Language Models" (NeurIPS2024)β15Jun 17, 2025Updated 9 months ago
- Data Collection System For NLP/Speech Recognitionβ25Apr 20, 2021Updated 4 years ago
- β19Oct 10, 2024Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Caβ¦β61May 9, 2023Updated 2 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoningβ¦β22Nov 2, 2021Updated 4 years ago