aryamanarora/causalgym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aryamanarora/causalgym)

aryamanarora / causalgym

CausalGym: Benchmarking causal interpretability methods on linguistic tasks

☆54

Alternatives and similar repositories for causalgym

Users that are interested in causalgym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kanishkamisra / wugs-and-daxes
View on GitHub
Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…
☆22Mar 20, 2024Updated 2 years ago
UIUCLearningLanguageLab / AOCHILDES
View on GitHub
Python API for loading language data from American-English CHILDES database
☆18Aug 14, 2022Updated 3 years ago
tommccoy1 / inductive-bias-distillation
View on GitHub
☆22Apr 5, 2026Updated 3 months ago
jennhu / lm-pragmatics
View on GitHub
Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"
☆11Dec 14, 2022Updated 3 years ago
am-bean / lingOly
View on GitHub
A benchmark for language models based on the UK Linguistics Olympiad
☆12Mar 3, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cpllab / syntactic-generalization
View on GitHub
Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"
☆31Jun 18, 2021Updated 5 years ago
ellisk42 / bpl_phonology
View on GitHub
☆16May 24, 2022Updated 4 years ago
apartresearch / specificityplus
View on GitHub
👩‍💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"
☆20Jan 19, 2024Updated 2 years ago
stanfordnlp / pyvene
View on GitHub
Stanford NLP Python library for understanding and improving PyTorch models via interventions
☆893Mar 6, 2026Updated 4 months ago
tilde-research / activault
View on GitHub
Engine for collecting, uploading, and downloading model activations
☆30Apr 2, 2025Updated last year
huhailinguist / ArguGPT
View on GitHub
☆22Sep 25, 2023Updated 2 years ago
kanishkamisra / minicons
View on GitHub
Utility for behavioral and representational analyses of Language Models
☆193Updated this week
MilaNLProc / language-invariant-properties
View on GitHub
☆22Mar 31, 2022Updated 4 years ago
EleutherAI / mdl
View on GitHub
Minimum Description Length probing for neural network representations
☆20Jan 28, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ivaxi0s / CausalGraph2LLM
View on GitHub
[NAACL'25] Evaluating LLMs for Causal Queries
☆14Feb 18, 2025Updated last year
ApolloResearch / e2e_sae
View on GitHub
Sparse Autoencoder Training Library
☆58May 1, 2025Updated last year
samiraabnar / Bridge
View on GitHub
Making a bridge between NLP models and Brain data
☆19Jun 3, 2020Updated 6 years ago
koayon / atp_star
View on GitHub
PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)
☆20Jan 19, 2025Updated last year
fdalvi / NeuroX
View on GitHub
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆110Oct 4, 2023Updated 2 years ago
EleutherAI / steering-llama3
View on GitHub
☆30Aug 2, 2024Updated last year
mmarius / montreal-things-to-do
View on GitHub
A list of things to do in Montréal.
☆28Oct 6, 2025Updated 9 months ago
yikee / Knowledge_Conflict
View on GitHub
Resolving Knowledge Conflicts in Large Language Models, COLM 2024
☆18Oct 7, 2025Updated 9 months ago
saprmarks / feature-circuits
View on GitHub
☆223Oct 14, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
neelnanda-io / 1L-Sparse-Autoencoder
View on GitHub
☆141Oct 28, 2023Updated 2 years ago
ckkissane / sae-transfer
View on GitHub
Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"
☆13Jul 18, 2024Updated 2 years ago
HoagyC / sparse_coding
View on GitHub
Using sparse coding to find distributed representations used by neural networks.
☆307Nov 10, 2023Updated 2 years ago
technion-cs-nlp / bias-probing
View on GitHub
Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data
☆14Apr 24, 2022Updated 4 years ago
babylm / evaluation-pipeline-2025
View on GitHub
☆26Aug 19, 2025Updated 11 months ago
XuchanBao / behavioral-self-awareness
View on GitHub
☆37Feb 20, 2025Updated last year
ZurichNLP / ContraWSD
View on GitHub
Word sense disambiguation test sets for NMT
☆21Dec 3, 2020Updated 5 years ago
MingyuJ666 / LVLM-Safety
View on GitHub
[FCS'24] LVLM Safety paper
☆19Jan 4, 2025Updated last year
iesl / s-diora
View on GitHub
☆12Jan 29, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
melsherief / hate_speech_icwsm18
View on GitHub
☆15Apr 10, 2018Updated 8 years ago
wenlai-lavine / m4Adapter
View on GitHub
m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)
☆19Mar 28, 2023Updated 3 years ago
babylm / evaluation-pipeline-2024
View on GitHub
The evaluation pipeline for the 2024 BabyLM Challenge.
☆34Nov 13, 2024Updated last year
mayhewsw / multilingual-data-stats
View on GitHub
Statistics on multilingual datasets
☆17Jul 12, 2022Updated 4 years ago
facebookresearch / coocmap
View on GitHub
code for paper "Accessing higher dimensions for unsupervised word translation"
☆23Jun 26, 2023Updated 3 years ago
simondschweitzer / aes
View on GitHub
AES - Ancient Egyptian Sentences; Corpus of Ancient Egyptian sentences for corpus-linguistic research
☆11May 18, 2021Updated 5 years ago
stanfordnlp / axbench
View on GitHub
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆210Mar 12, 2026Updated 4 months ago