πͺPISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
β12May 30, 2025Updated 11 months ago
Alternatives and similar repositories for PISCES
Users that are interested in PISCES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β19Sep 16, 2025Updated 7 months ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.β17Apr 25, 2021Updated 5 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)β15Feb 24, 2026Updated 2 months ago
- β19Aug 30, 2025Updated 8 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.β18Apr 22, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β18Oct 6, 2022Updated 3 years ago
- β14Dec 22, 2025Updated 4 months ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.β24Apr 22, 2025Updated last year
- β35Feb 15, 2026Updated 2 months ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β27Nov 3, 2025Updated 5 months ago
- Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!β19Jun 9, 2023Updated 2 years ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]β19Jul 3, 2025Updated 9 months ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformersβ21May 16, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)β11Oct 25, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β13Jul 26, 2023Updated 2 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β26Mar 10, 2025Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"β30Apr 28, 2023Updated 3 years ago
- Repository for "Training Language Models To Explain Their Own Computations"β22Dec 22, 2025Updated 4 months ago
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.β16Feb 5, 2025Updated last year
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.β11Dec 20, 2018Updated 7 years ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checkingβ13Apr 11, 2025Updated last year
- β11Nov 13, 2021Updated 4 years ago
- Extension injector into Electron appsβ23Aug 10, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for "Tracing Knowledge in Language Models Back to the Training Data"β39Dec 27, 2022Updated 3 years ago
- Measuring the Mixing of Contextual Information in the Transformerβ34May 27, 2023Updated 2 years ago
- [CVPR 2025] Spectral State Space Model for Rotation-Invariant Visual Representation Learningβ18Oct 13, 2025Updated 6 months ago
- Rationales for Sequential Predictionsβ40Mar 10, 2022Updated 4 years ago
- Dangerous Dave reverse engineering and level editing utilityβ24Oct 25, 2024Updated last year
- β51Oct 23, 2023Updated 2 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"β16Aug 11, 2023Updated 2 years ago
- [EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"β39Aug 20, 2025Updated 8 months ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explβ¦β12Apr 4, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writingβ14Jun 25, 2023Updated 2 years ago
- β18Apr 16, 2021Updated 5 years ago
- Author implementation of "Learning to Search in Long Documents Using Document Structure" (Mor Geva and Jonathan Berant, 2018)β22Jul 12, 2018Updated 7 years ago
- Official implementation for "HuRef: HUman-REadable Fingerprint for Large Language Models" (NeurIPS2024)β15Jun 17, 2025Updated 10 months ago
- Data Collection System For NLP/Speech Recognitionβ25Apr 20, 2021Updated 5 years ago
- β19Oct 10, 2024Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Caβ¦β61May 9, 2023Updated 2 years ago