juliancodaforno / CogBenchLinks
☆23Updated last year
Alternatives and similar repositories for CogBench
Users that are interested in CogBench are comparing it to the libraries listed below
Sorting:
- Largest, cross-domain data set of human behavior.☆86Updated 6 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆149Updated 11 months ago
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆108Updated 3 months ago
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆46Updated last year
- ☆137Updated last year
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆277Updated last week
- Sparse Autoencoder for Mechanistic Interpretability☆289Updated last year
- ☆114Updated 11 months ago
- ☆36Updated last year
- ☆16Updated 2 years ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆238Updated this week
- Governance of the Commons Simulation (GovSim)☆64Updated last year
- Temporal Neural Networks☆22Updated 2 weeks ago
- This repository collects all relevant resources about interpretability in LLMs☆390Updated last year
- ☆29Updated last week
- ☆265Updated last year
- ☆40Updated 10 months ago
- Using sparse coding to find distributed representations used by neural networks.☆293Updated 2 years ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆163Updated 7 months ago
- ☆53Updated 9 months ago
- Code repository for the paper "Mission: Impossible Language Models."☆56Updated 4 months ago
- TopoLM: brain-like spatio-functional organization in a topographic language model☆25Updated 8 months ago
- Extracting spatial and temporal world models from LLMs☆257Updated 2 years ago
- Code repo for the model organisms and convergent directions of EM papers.☆45Updated 4 months ago
- ☆195Updated last year
- maze datasets for investigating OOD behavior of ML systems☆70Updated last week
- ☆30Updated 2 years ago
- ☆25Updated 4 months ago
- Menagerie of models trained on SAYCam (and more)☆27Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆137Updated last year