Attribute statements generated by LLMs to preceding tokens using attention weights.
β24Apr 22, 2025Updated last year
Alternatives and similar repositories for AT2
Users that are interested in AT2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/β26Mar 10, 2025Updated last year
- πͺPISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Modelsβ12May 30, 2025Updated 11 months ago
- P.h.D. course on "Complex Network Analysis" @ Universitat AutΓ²noma de Barcelona (2022)β15Apr 29, 2022Updated 4 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.β17Apr 25, 2021Updated 5 years ago
- Minimal template for a Python library projectβ11Nov 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Attribute (or cite) statements generated by LLMs back to in-context information.β332Oct 8, 2024Updated last year
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"β14Nov 17, 2023Updated 2 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)β15Feb 24, 2026Updated 2 months ago
- Attributed Stream Hypergraphβ16Nov 20, 2025Updated 5 months ago
- Have an AI debate against you on any topic of your choosingβ15Oct 13, 2024Updated last year
- Custom graph/network/multi-weighted network class based on storing list of neighbors for each nodes (as opposed to edge list) for scalablβ¦β11Jan 18, 2024Updated 2 years ago
- β19Aug 30, 2025Updated 8 months ago
- β18Oct 6, 2022Updated 3 years ago
- (In progress) Network science laboratories. Covers graph theory, random graphs and ML on graphsβ18Mar 4, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in Laβ¦β27Nov 3, 2025Updated 5 months ago
- π₯π₯[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoningβ28Dec 11, 2025Updated 4 months ago
- SSNMF models and multiplicative update methods.β10Sep 4, 2025Updated 7 months ago
- A Python library for the detection and visualization of emotions in textsβ22Mar 21, 2025Updated last year
- Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!β19Jun 9, 2023Updated 2 years ago
- DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]β19Jul 3, 2025Updated 9 months ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformersβ21May 16, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)β11Oct 25, 2021Updated 4 years ago
- Exploratory notebooks for data science using Bluesky dataβ23Nov 14, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β13Jul 26, 2023Updated 2 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"β30Apr 28, 2023Updated 3 years ago
- Repository for "Training Language Models To Explain Their Own Computations"β22Dec 22, 2025Updated 4 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMsβ24Sep 21, 2025Updated 7 months ago
- Material for course "Geospatial Analytics" (GSA), master degree in Data Science and Business Analytics, University of Pisaβ26Oct 25, 2024Updated last year
- Details about the wide minima density hypothesis and code to compute width of a minimaβ10Nov 30, 2024Updated last year
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citationsβ13Sep 11, 2024Updated last year
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checkingβ13Apr 11, 2025Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"β39Dec 27, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Measuring the Mixing of Contextual Information in the Transformerβ34May 27, 2023Updated 2 years ago
- FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package buβ¦β13Apr 25, 2024Updated 2 years ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Imagesβ19Jun 4, 2025Updated 10 months ago
- Rationales for Sequential Predictionsβ40Mar 10, 2022Updated 4 years ago
- Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networksβ16Aug 22, 2019Updated 6 years ago
- β51Oct 23, 2023Updated 2 years ago
- [EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"β39Aug 20, 2025Updated 8 months ago