microsoft/mechanistic-error-probe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/mechanistic-error-probe)

microsoft / mechanistic-error-probe

A mechanistic approach for understanding and detecting factual errors of large language models.

☆50

Alternatives and similar repositories for mechanistic-error-probe

Users that are interested in mechanistic-error-probe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vinid / quica
View on GitHub
quica is a tool to run inter coder agreement pipelines in an easy and effective ways. Multiple measures are run and results are collected…
☆23Nov 9, 2020Updated 5 years ago
vinid / NegotiationArena
View on GitHub
☆84Mar 26, 2024Updated 2 years ago
vinid / cade
View on GitHub
Compass-aligned Distributional Embeddings. Align embeddings from different corpora
☆43Dec 26, 2022Updated 3 years ago
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Jul 18, 2026Updated last week
lucidrains / self-reasoning-tokens-pytorch
View on GitHub
Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto
☆57May 17, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ambuc / cornea
View on GitHub
👁️ Isometric 3D Graphing / Rendering module for Haskell
☆15Sep 2, 2017Updated 8 years ago
JingXuTHU / Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuning
View on GitHub
☆14May 4, 2024Updated 2 years ago
outerbounds / metaflow-instruction-tuning
View on GitHub
☆12Oct 25, 2023Updated 2 years ago
EngSalem / HaLo
View on GitHub
☆16Sep 27, 2023Updated 2 years ago
nreimers / flax-sentence-embeddings
View on GitHub
Shared code for training sentence embeddings with Flax / JAX
☆28Jul 15, 2021Updated 5 years ago
orensul / analogies_mining
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
wikifactcheck-english / wikifactcheck-english
View on GitHub
Data and download script to accompany LREC2020 paper "Automated Fact-Checking of Claims from Wikipedia"
☆13Jul 19, 2023Updated 3 years ago
s-ball-10 / jailbreak_dynamics
View on GitHub
☆25Jun 13, 2024Updated 2 years ago
graphml-lab-pwr / lapeigvals
View on GitHub
Implementation of the paper "Hallucination Detection in LLMs Using Spectral Features of Attention Maps"
☆16Oct 18, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
CRIPAC-DIG / SCGAN
View on GitHub
[ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"
☆11Apr 26, 2024Updated 2 years ago
balevinstein / Probes
View on GitHub
☆58Jun 30, 2023Updated 3 years ago
JoshEngels / SAE-Dark-Matter
View on GitHub
Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"
☆23Feb 6, 2025Updated last year
microsoft / BackwardCompatibilityML
View on GitHub
Project for open sourcing research efforts on Backward Compatibility in Machine Learning
☆75Oct 3, 2023Updated 2 years ago
shmulvad / zero-for-ner
View on GitHub
Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge
☆17Nov 16, 2021Updated 4 years ago
Jaredk3nt / phoenix-padding
View on GitHub
Simple phoenix setup for padded window management
☆13Apr 25, 2018Updated 8 years ago
kslav / cdr_mri
View on GitHub
This is the code corresponding to our publication introducing ConvDecoder with physics-based regularization (CD+r) for MRI
☆10Feb 6, 2026Updated 5 months ago
julianje / Bishop
View on GitHub
Mental state inference from observable behavior
☆15Dec 3, 2021Updated 4 years ago
SunbowLiu / PTvsBT
View on GitHub
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))
☆13Nov 21, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
neulab / ToM-Language-Acquisition
View on GitHub
Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".
☆15Apr 27, 2023Updated 3 years ago
edulinq / pacai
View on GitHub
An AI educational project disguised as Pac-Man!
☆16Jul 15, 2026Updated 2 weeks ago
hasanar1f / HiRED
View on GitHub
[AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…
☆58Apr 18, 2025Updated last year
dat550-2021 / course-info
View on GitHub
☆10Apr 14, 2021Updated 5 years ago
Aaquib111 / edge-attribution-patching
View on GitHub
Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"
☆48May 31, 2024Updated 2 years ago
outerbounds / rag-demo
View on GitHub
☆20Apr 12, 2024Updated 2 years ago
microsoft / eureka-ml-insights
View on GitHub
A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.
☆185Jun 16, 2026Updated last month
tml-tuebingen / nshap
View on GitHub
Python package to compute interaction indices that extend the Shapley Value. AISTATS 2023.
☆20Sep 25, 2023Updated 2 years ago
msclar / symmtom
View on GitHub
Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.
☆12Jul 18, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
yunshiuan / tomnet-project
View on GitHub
This repo contains the ToMnet+ model for preference inference. Developed by Yun-Shiuan, Edwinn, Hsin-Yi, and Elaine.
☆10Feb 24, 2023Updated 3 years ago
cblearn / cblearn
View on GitHub
Comparison-based Machine Learning in Python
☆21Jun 16, 2024Updated 2 years ago
hkust-nlp / Activation_Decoding
View on GitHub
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆64Mar 30, 2024Updated 2 years ago
cassidylaidlaw / hidden-context
View on GitHub
Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
☆35Dec 14, 2023Updated 2 years ago
Cadenza-Labs / sleeper-agents
View on GitHub
☆15Jul 12, 2024Updated 2 years ago
allenai / noncompliance
View on GitHub
This repository contains data, code and models for contextual noncompliance.
☆26Jul 18, 2024Updated 2 years ago
ryantibs / statlearn-s24
View on GitHub
Course materials for Advanced Topics in Statistical Learning, Spring 2024
☆29Jul 14, 2025Updated last year