juliancodaforno / CogBenchLinks
☆23Updated last year
Alternatives and similar repositories for CogBench
Users that are interested in CogBench are comparing it to the libraries listed below
Sorting:
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆46Updated last year
- Largest, cross-domain data set of human behavior.☆85Updated 5 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆147Updated 10 months ago
- Sparse Autoencoder for Mechanistic Interpretability☆286Updated last year
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆95Updated last month
- ☆39Updated 9 months ago
- Temporal Neural Networks☆21Updated 4 months ago
- Implementation of the Tolman Eichenbaum Machine in pytorch☆168Updated 3 years ago
- ☆16Updated 2 years ago
- TopoLM: brain-like spatio-functional organization in a topographic language model☆23Updated 6 months ago
- ☆133Updated last year
- ☆82Updated 2 months ago
- Code for 'Emergent Analogical Reasoning in Large Language Models'☆51Updated last year
- maze datasets for investigating OOD behavior of ML systems☆67Updated last month
- Governance of the Commons Simulation (GovSim)☆62Updated 11 months ago
- ☆260Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆134Updated last year
- Lu, Q., Hasson, U., & Norman, K. A. (2022). A neural network model of when to retrieve and encode episodic memories. eLife☆44Updated 3 years ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆267Updated this week
- ☆112Updated 10 months ago
- ☆21Updated last year
- ☆36Updated last year
- Menagerie of models trained on SAYCam (and more)☆25Updated last year
- ☆25Updated 2 months ago
- Meta-Learning for Compositionality (MLC) for modeling human behavior☆145Updated last month
- ☆89Updated 8 months ago
- Universal Neurons in GPT2 Language Models☆31Updated last year
- A framework for evaluating models on their alignment to brain and behavioral measurements (100+ benchmarks)☆164Updated this week
- Code repository for the paper "Mission: Impossible Language Models."☆56Updated 2 months ago
- ☆30Updated 2 years ago