ronigold / TokenSHAPLinks

TokenSHAP: Explain individual token importance in large language model prompts with SHAP values. Gain insights, debug models, detect biases, and enhance transparency effortlessly

☆46

Alternatives and similar repositories for TokenSHAP

Users that are interested in TokenSHAP are comparing it to the libraries listed below

Sorting:

center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆282Updated 4 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
jongjyh / TrFr
Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning
☆46Updated last year
apple / ml-superposition-prompting
☆145Updated 11 months ago
microsoft / eureka-ml-insights
A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
☆163Updated this week
MoritzLaurer / synthetic-data-blog
This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data
☆68Updated last year
IBM / unitxt
🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …
☆206Updated this week
quotient-ai / judges
A small library of LLM judges
☆232Updated 3 weeks ago
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆123Updated last week
writer / writing-in-the-margins
☆118Updated 10 months ago
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆139Updated last month
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆194Updated 2 months ago
KihoPark / LLM_Categorical_Hierarchical_Representations
☆101Updated 5 months ago
MoritzLaurer / zeroshot-classifier
Notebooks for training universal 0-shot classifiers on many different tasks
☆131Updated 6 months ago
LAGoM-NLP / transtokenizer
☆48Updated 5 months ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆87Updated 5 months ago
huggingface / competitions
☆124Updated 8 months ago
IBM / eval-assist
EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…
☆59Updated this week
huggingface / data-is-better-together
Let's build better datasets, together!
☆260Updated 6 months ago
illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆215Updated last week
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 9 months ago
sdascoli / boolformer
☆163Updated last year
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆276Updated last year
jina-ai / correlations
Simple UI for debugging correlations of text embeddings
☆287Updated last month
jjovalle99 / raft-well-architected
☆20Updated last year
apple / ml-hypercloning
☆48Updated 8 months ago
flairNLP / transformer-ranker
Efficiently find the best-suited language model (LM) for your NLP task
☆123Updated this week
MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆250Updated 9 months ago
allenai / infinigram-api
☆70Updated this week
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆289Updated last week