CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆53Nov 30, 2024Updated last year
Alternatives and similar repositories for causalgym
Users that are interested in causalgym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python API for loading language data from American-English CHILDES database☆18Aug 14, 2022Updated 3 years ago
- Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"☆11Dec 14, 2022Updated 3 years ago
- A benchmark for language models based on the UK Linguistics Olympiad☆12Mar 3, 2025Updated last year
- [Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring☆12Dec 1, 2023Updated 2 years ago
- ☆22Apr 5, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆30Jun 18, 2021Updated 5 years ago
- ☆15May 24, 2022Updated 4 years ago
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆883Mar 6, 2026Updated 3 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆198Mar 12, 2026Updated 3 months ago
- ☆22Sep 25, 2023Updated 2 years ago
- Utility for behavioral and representational analyses of Language Models☆187Updated this week
- ☆22Mar 31, 2022Updated 4 years ago
- [NAACL'25] Evaluating LLMs for Causal Queries☆14Feb 18, 2025Updated last year
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Sparse Autoencoder Training Library☆57May 1, 2025Updated last year
- ☆95May 7, 2026Updated last month
- Making a bridge between NLP models and Brain data☆19Jun 3, 2020Updated 6 years ago
- ☆219Oct 14, 2025Updated 8 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 8 months ago
- ☆136Oct 28, 2023Updated 2 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Using sparse coding to find distributed representations used by neural networks.☆305Nov 10, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The CausalPlayground library serves as a tool for causality research, focusing on the interactive exploration of structural causal models…☆18Jun 5, 2024Updated 2 years ago
- ☆422Aug 21, 2025Updated 9 months ago
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data☆14Apr 24, 2022Updated 4 years ago
- [ICLR 2025] ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains☆17Mar 4, 2025Updated last year
- Word sense disambiguation test sets for NMT☆21Dec 3, 2020Updated 5 years ago
- ☆22May 7, 2025Updated last year
- CAMeL Dataset☆15Apr 15, 2025Updated last year
- AI-ready open dataset of e-commerce coupons, deals & redeem-links curated by Kindred☆18May 2, 2025Updated last year
- Code for the paper "Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages" (N…☆17Apr 13, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Apr 10, 2018Updated 8 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 3 years ago
- ☆48Jan 3, 2026Updated 5 months ago
- The evaluation pipeline for the 2024 BabyLM Challenge.☆34Nov 13, 2024Updated last year
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- Code for "Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?"☆46Jan 17, 2024Updated 2 years ago
- ☆56Oct 23, 2023Updated 2 years ago