ronigold / TokenSHAPLinks
TokenSHAP: Explain individual token importance in large language model prompts with SHAP values. Gain insights, debug models, detect biases, and enhance transparency effortlessly
☆46Updated 3 months ago
Alternatives and similar repositories for TokenSHAP
Users that are interested in TokenSHAP are comparing it to the libraries listed below
Sorting:
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆282Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆46Updated last year
- ☆145Updated 11 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆163Updated this week
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆206Updated this week
- A small library of LLM judges☆232Updated 3 weeks ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆123Updated last week
- ☆118Updated 10 months ago
- Generalist and Lightweight Model for Text Classification☆139Updated last month
- code for training & evaluating Contextual Document Embedding models☆194Updated 2 months ago
- ☆101Updated 5 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆131Updated 6 months ago
- ☆48Updated 5 months ago
- PyTorch library for Active Fine-Tuning☆87Updated 5 months ago
- ☆124Updated 8 months ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆59Updated this week
- Let's build better datasets, together!☆260Updated 6 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆215Updated last week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- ☆163Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Simple UI for debugging correlations of text embeddings☆287Updated last month
- ☆20Updated last year
- ☆48Updated 8 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆123Updated this week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆250Updated 9 months ago
- ☆70Updated this week
- awesome synthetic (text) datasets☆289Updated last week