juliancodaforno / CogBenchLinks
☆23Updated last year
Alternatives and similar repositories for CogBench
Users that are interested in CogBench are comparing it to the libraries listed below
Sorting:
- Largest, cross-domain data set of human behavior.☆85Updated 5 months ago
- ☆36Updated last year
- ☆135Updated last year
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆101Updated 2 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆149Updated 10 months ago
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆46Updated last year
- ☆112Updated 10 months ago
- Code for 'Emergent Analogical Reasoning in Large Language Models'☆51Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆235Updated 2 weeks ago
- ☆16Updated 2 years ago
- Sparse Autoencoder for Mechanistic Interpretability☆285Updated last year
- ☆82Updated 3 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆132Updated 3 years ago
- Temporal Neural Networks☆21Updated 5 months ago
- Code repository for the paper "Mission: Impossible Language Models."☆56Updated 3 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆272Updated last week
- ☆227Updated last year
- TopoLM: brain-like spatio-functional organization in a topographic language model☆24Updated 7 months ago
- Menagerie of models trained on SAYCam (and more)☆26Updated last year
- Sparse Autoencoder Training Library☆56Updated 8 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆221Updated last month
- Governance of the Commons Simulation (GovSim)☆64Updated 11 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆100Updated 2 years ago
- ☆98Updated last year
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆101Updated 2 months ago
- ☆27Updated 3 weeks ago
- ☆193Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆63Updated last year
- Sparse probing paper full code.☆66Updated 2 years ago
- ☆109Updated last year