πͺPISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
β12May 30, 2025Updated last year
Alternatives and similar repositories for PISCES
Users that are interested in PISCES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β20Apr 26, 2026Updated last month
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.β17Apr 25, 2021Updated 5 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)β15Feb 24, 2026Updated 3 months ago
- β23Aug 30, 2025Updated 9 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.β17Apr 22, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β18Oct 6, 2022Updated 3 years ago
- β14Dec 22, 2025Updated 5 months ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β29Nov 3, 2025Updated 7 months ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.β26Apr 22, 2025Updated last year
- Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!β19Jun 9, 2023Updated 3 years ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]β19Jul 3, 2025Updated 11 months ago
- β92May 7, 2026Updated last month
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformersβ21May 16, 2023Updated 3 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)β11Oct 25, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β13Jul 26, 2023Updated 2 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β25Mar 10, 2025Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"β29Apr 28, 2023Updated 3 years ago
- Repository for "Training Language Models To Explain Their Own Computations"β22Dec 22, 2025Updated 5 months ago
- Code for ACL 2024 paper: PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models.β16Feb 5, 2025Updated last year
- VertMetric: An abstractive summarization evaluation package. VERT stands for Versatile Evaluation of Reduced Texts.β12Dec 20, 2018Updated 7 years ago
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checkingβ13Apr 11, 2025Updated last year
- β11Nov 13, 2021Updated 4 years ago
- Extension injector into Electron appsβ23Aug 10, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for "Tracing Knowledge in Language Models Back to the Training Data"β40Dec 27, 2022Updated 3 years ago
- Measuring the Mixing of Contextual Information in the Transformerβ35May 27, 2023Updated 3 years ago
- [CVPR 2025] Spectral State Space Model for Rotation-Invariant Visual Representation Learningβ18Oct 13, 2025Updated 7 months ago
- Rationales for Sequential Predictionsβ40Mar 10, 2022Updated 4 years ago
- Dangerous Dave reverse engineering and level editing utilityβ26Oct 25, 2024Updated last year
- β54Oct 23, 2023Updated 2 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"β16Aug 11, 2023Updated 2 years ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explβ¦β12Apr 4, 2025Updated last year
- [EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"β40Aug 20, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writingβ14Jun 25, 2023Updated 2 years ago
- β20Apr 16, 2021Updated 5 years ago
- Author implementation of "Learning to Search in Long Documents Using Document Structure" (Mor Geva and Jonathan Berant, 2018)β22Jul 12, 2018Updated 7 years ago
- Official implementation for "HuRef: HUman-REadable Fingerprint for Large Language Models" (NeurIPS2024)β16Jun 17, 2025Updated 11 months ago
- Data Collection System For NLP/Speech Recognitionβ25Apr 20, 2021Updated 5 years ago
- β19Oct 10, 2024Updated last year
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoningβ¦β22Nov 2, 2021Updated 4 years ago