juliancodaforno / CogBenchLinks
☆23Updated last year
Alternatives and similar repositories for CogBench
Users that are interested in CogBench are comparing it to the libraries listed below
Sorting:
- Largest, cross-domain data set of human behavior.☆85Updated 6 months ago
- Menagerie of models trained on SAYCam (and more)☆26Updated last year
- TopoLM: brain-like spatio-functional organization in a topographic language model☆24Updated 7 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆149Updated 10 months ago
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆104Updated 2 months ago
- Implementation of the Tolman Eichenbaum Machine in pytorch☆168Updated 3 years ago
- ☆36Updated last year
- Tools for exploring Transformer neuron behaviour, including input pruning and diversification.☆23Updated 2 years ago
- ☆25Updated 3 months ago
- ☆135Updated last year
- ☆39Updated 9 months ago
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆46Updated last year
- Temporal Neural Networks☆21Updated 5 months ago
- maze datasets for investigating OOD behavior of ML systems☆70Updated 2 months ago
- Code for 'Emergent Analogical Reasoning in Large Language Models'☆51Updated last year
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆272Updated this week
- Governance of the Commons Simulation (GovSim)☆64Updated 11 months ago
- ☆16Updated 2 years ago
- This repository collects all relevant resources about interpretability in LLMs☆389Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆38Updated last year
- Code repository for the paper "Mission: Impossible Language Models."☆56Updated 3 months ago
- Sparse Autoencoder for Mechanistic Interpretability☆285Updated last year
- ☆21Updated last year
- Benchmarking Agentic LLM and VLM Reasoning On Games☆221Updated last month
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆136Updated last year
- NiceWebRL is a Python library for quickly making human subject experiments that leverage machine reinforcement learning environments.☆76Updated last month
- Universal Neurons in GPT2 Language Models☆31Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆100Updated 2 years ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆101Updated 2 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆235Updated 2 weeks ago