Leukas/CUTE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Leukas/CUTE)

Leukas / CUTE

☆19

Alternatives and similar repositories for CUTE

Users that are interested in CUTE are comparing it to the libraries listed below

Sorting:

RulinShao / RAG-evaluation-harnesses
View on GitHub
An evaluation suite for Retrieval-Augmented Generation (RAG).
☆23Apr 26, 2025Updated 10 months ago
yoavgur / PISCES
View on GitHub
🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models
☆12May 30, 2025Updated 9 months ago
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
SciMT / SciMT-benchmark
View on GitHub
☆11Jan 3, 2024Updated 2 years ago
flairNLP / familiarity
View on GitHub
Label shift estimation for transfer difficulty with Familiarity.
☆10Feb 4, 2025Updated last year
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
Betswish / MIRAGE
View on GitHub
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆26Mar 10, 2025Updated 11 months ago
arnab-api / romba
View on GitHub
Applies ROME and MEMIT on Mamba-S4 models
☆14Apr 5, 2024Updated last year
HallerPatrick / pecc
View on GitHub
[LREC-Coling 2024] PECC: Problem Extraction and Coding Challenges
☆14May 30, 2024Updated last year
lm-pub-quiz / lm-pub-quiz
View on GitHub
Evaluate language models using multiple choice items
☆13Updated this week
technion-cs-nlp / parametric-faithfulness
View on GitHub
☆17Aug 30, 2025Updated 6 months ago
jqueguiner / wav2vec2-sprint
View on GitHub
docker for HF wav2vec2-sprint
☆13Mar 26, 2021Updated 4 years ago
GraphPKU / Case_or_Rule
View on GitHub
exploring whether LLMs perform case-based or rule-based reasoning
☆30Mar 2, 2024Updated 2 years ago
Heidelberg-NLP / CC-SHAP-VLM
View on GitHub
Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…
☆12Apr 4, 2025Updated 10 months ago
FarnoushRJ / RelP
View on GitHub
[NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…
☆27Nov 3, 2025Updated 3 months ago
dhfbk / KIND
View on GitHub
KIND: an Italian Multi-Domain Dataset for Named Entity Recognition
☆15Jun 28, 2023Updated 2 years ago
zouharvi / subset2evaluate
View on GitHub
Find informative examples to efficiently (human)-evaluate NLG models.
☆18Updated this week
MadryLab / AT2
View on GitHub
Attribute statements generated by LLMs to preceding tokens using attention weights.
☆22Apr 22, 2025Updated 10 months ago
visinf / fast-axiomatic-attribution
View on GitHub
Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)
☆16Updated this week
wietsedv / xpos
View on GitHub
Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)
☆19May 17, 2022Updated 3 years ago
xiye17 / EvalQAExpl
View on GitHub
Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.
☆18Apr 25, 2021Updated 4 years ago
bitextor / warc2text
View on GitHub
Extracts plain text, language identification and more metadata from WARC records
☆23Oct 1, 2025Updated 5 months ago
mohsenfayyaz / DecompX
View on GitHub
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]
☆19Jul 3, 2025Updated 7 months ago
anthonywchen / MOCHA
View on GitHub
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16May 3, 2022Updated 3 years ago
mt-upc / transformer-contributions-nmt
View on GitHub
☆18Oct 6, 2022Updated 3 years ago
ekinakyurek / influence
View on GitHub
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆39Dec 27, 2022Updated 3 years ago
mitvis / saliency-cards
View on GitHub
Saliency Cards are transparency documentation for saliency methods. Learn about new saliency methods or document your own!
☆19Jun 9, 2023Updated 2 years ago
alexa / ramen
View on GitHub
A software for transferring pre-trained English models to foreign languages
☆19Mar 20, 2023Updated 2 years ago
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆28Apr 17, 2024Updated last year
whyNLP / Probabilistic-Transformer
View on GitHub
A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.
☆25Oct 22, 2023Updated 2 years ago
mohsenfayyaz / GlobEnc
View on GitHub
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21May 16, 2023Updated 2 years ago
huggingface / datasets-tagging
View on GitHub
A Streamlit app to add structured tags to a dataset card
☆22Jun 30, 2022Updated 3 years ago
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
doerlbh / MiniVox
View on GitHub
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
☆29Sep 20, 2021Updated 4 years ago
hrwise-nlp / Cue-CoT
View on GitHub
Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]
☆24Nov 18, 2023Updated 2 years ago
crux82 / squad-it
View on GitHub
A large scale dataset for Question Answering in Italian
☆27Nov 18, 2018Updated 7 years ago
KGQA / QALD_9_plus
View on GitHub
QALD-9-Plus Dataset for Knowledge Graph Question Answering
☆29Jun 5, 2024Updated last year
SapienzaNLP / ita-bench
View on GitHub
A collection of Italian benchmarks for LLM evaluation
☆37Dec 2, 2025Updated 2 months ago
tianyu-z / VCR
View on GitHub
Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
☆32Feb 26, 2025Updated last year