Find the samples, in the test data, on which your (generative) model makes mistakes.
☆29Oct 16, 2024Updated last year
Alternatives and similar repositories for infembed
Users that are interested in infembed are comparing it to the libraries listed below
Sorting:
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Jan 12, 2024Updated 2 years ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆37Jan 23, 2024Updated 2 years ago
- ☆32May 24, 2023Updated 2 years ago
- Repository for the D ONE MLOps AWS BlogPost☆11Aug 13, 2024Updated last year
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- ☆10Oct 5, 2022Updated 3 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆184Jun 24, 2025Updated 8 months ago
- Graphical user interface for text-guided face editing☆11Jan 18, 2023Updated 3 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- ACL24☆11Jun 7, 2024Updated last year
- A demo showing off daily-bots realtime voice and convex☆12Feb 5, 2026Updated 3 weeks ago
- Code and Data for GlitchBench☆13Feb 27, 2024Updated 2 years ago
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Oct 28, 2024Updated last year
- ☆10Aug 7, 2023Updated 2 years ago
- Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.☆18Jun 21, 2024Updated last year
- ☆10Oct 20, 2023Updated 2 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Jan 2, 2026Updated 2 months ago
- ☆34Jan 25, 2026Updated last month
- ☆13Nov 22, 2024Updated last year
- ☆23Jun 5, 2025Updated 8 months ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- ☆12Oct 2, 2023Updated 2 years ago
- A teeny tiny set of ImageNet-like images for testing pipelines☆10Jan 31, 2018Updated 8 years ago
- Fine-tuning-free Shapley value (FreeShap) for instance attribution☆14May 29, 2024Updated last year
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- ☆24Oct 2, 2025Updated 5 months ago
- ☆13Apr 13, 2025Updated 10 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- ☆11Dec 15, 2024Updated last year
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Understanding Rare Spurious Correlations in Neural Network☆12Jun 5, 2022Updated 3 years ago
- AI Logging for Interpretability and Explainability🔬☆140Jun 7, 2024Updated last year
- Code repository for the CoRL 2021 paper "RoCUS: Robot Controller Understanding via Sampling"☆11Mar 24, 2022Updated 3 years ago
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated 9 months ago
- moodist☆24Feb 20, 2026Updated last week
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated 9 months ago
- ☆13Dec 12, 2025Updated 2 months ago