balevinstein/Probes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/balevinstein/Probes)

balevinstein / Probes

☆58

Alternatives and similar repositories for Probes

Users that are interested in Probes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eth-sri / ChatProtect
View on GitHub
This is the code for the paper "Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation".
☆38Apr 15, 2026Updated 3 months ago
orhonovich / q-squared
View on GitHub
☆30Sep 5, 2021Updated 4 years ago
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Updated this week
rmin2000 / adv_tracing
View on GitHub
Identification of the Adversary from a Single Adversarial Example (ICML 2023)
☆10Jul 15, 2024Updated 2 years ago
likenneth / honest_llama
View on GitHub
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆581Jan 28, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
oneal2000 / MIND
View on GitHub
Source code of our paper MIND, ACL 2024 Long Paper
☆65Nov 14, 2025Updated 8 months ago
graphml-lab-pwr / lapeigvals
View on GitHub
Implementation of the paper "Hallucination Detection in LLMs Using Spectral Features of Attention Maps"
☆16Oct 18, 2025Updated 9 months ago
JiaQiSJTU / FaithEval-FFLM
View on GitHub
A zero-shot faithfulness evaluation metric for text summarization
☆11Oct 17, 2023Updated 2 years ago
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
nusnlp / FSPO
View on GitHub
Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆26Oct 31, 2025Updated 8 months ago
bryanchrist / MathNeuro
View on GitHub
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆23Jun 15, 2025Updated last year
EdinburghNLP / awesome-hallucination-detection
View on GitHub
List of papers on hallucination detection in LLMs.
☆1,120Jun 6, 2026Updated last month
D2I-ai / eigenscore
View on GitHub
☆46Dec 9, 2024Updated last year
thestephencasper / latent_adversarial_training
View on GitHub
☆24Jul 25, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lorenzkuhn / semantic_uncertainty
View on GitHub
☆186Jun 20, 2024Updated 2 years ago
yinzhangyue / SelfAware
View on GitHub
Do Large Language Models Know What They Don’t Know?
☆103Nov 8, 2024Updated last year
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆449Apr 13, 2025Updated last year
terarachang / DataICL
View on GitHub
Data Valuation on In-Context Examples (ACL23)
☆24Jan 12, 2025Updated last year
jsrozner / decrypt
View on GitHub
Repository for paper Decrypting Cryptic Crosswords
☆11Jan 15, 2022Updated 4 years ago
RUCAIBox / HaluEval
View on GitHub
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆592Feb 12, 2024Updated 2 years ago
yaronbeen / YTDemos
View on GitHub
☆18Jan 9, 2024Updated 2 years ago
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
chrisnager / chrisnager-dot-com
View on GitHub
chrisnager.com
☆12Jul 12, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MSR-LIT / MultilingualBias
View on GitHub
☆10Jul 6, 2023Updated 3 years ago
mukhal / GRACE
View on GitHub
[EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning
☆50Oct 11, 2024Updated last year
kite99520 / DialSummEval
View on GitHub
Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"
☆14Jul 22, 2025Updated 11 months ago
EleutherAI / elk
View on GitHub
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆221Updated this week
nyu-mll / BBQ
View on GitHub
Repository for the Bias Benchmark for QA dataset.
☆146Jan 8, 2024Updated 2 years ago
tatsu-lab / linguistic_calibration
View on GitHub
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆29Jun 4, 2024Updated 2 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
microsoft / mechanistic-error-probe
View on GitHub
A mechanistic approach for understanding and detecting factual errors of large language models.
☆50Jul 6, 2024Updated 2 years ago
jiaqima / G3NN
View on GitHub
A Flexible Generative Framework for Graph-based Semi-supervised Learning (NeurIPS 2019)
☆16Nov 14, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gokulp01 / ComTraq-MPC
View on GitHub
[IROS 2024] "ComTraQ-MPC: Meta-Trained DQN-MPC Integration for Trajectory Tracking with Limited Active Localization Updates" by Gokul Put…
☆13Apr 10, 2025Updated last year
HKUST-KnowComp / SP-10K
View on GitHub
SP-10K is a large-scale human-annotated selectional preference set. Five selectional preference relations are included.
☆12May 6, 2020Updated 6 years ago
genglinliu / UnknownBench
View on GitHub
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge
☆14Feb 20, 2024Updated 2 years ago
LoryPack / LLM-LieDetector
View on GitHub
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆74Jun 19, 2024Updated 2 years ago
THU-KEG / KoLA
View on GitHub
[ICLR24] The open-source repo of THU-KEG's KoLA benchmark.
☆57Sep 28, 2023Updated 2 years ago
wzhouad / context-faithful-llm
View on GitHub
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Mar 23, 2023Updated 3 years ago
hwanheelee1993 / KPQA
View on GitHub
KPQA is an evaluation metric for generative question answering. (NAACL-21)
☆33Aug 3, 2021Updated 4 years ago