ccs-ucb / llms-cogsciLinks
Psych 290Q S23 @ UC Berkeley: Large Language Models and Cognitive Science
☆18Updated last year
Alternatives and similar repositories for llms-cogsci
Users that are interested in llms-cogsci are comparing it to the libraries listed below
Sorting:
- ☆134Updated 7 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆26Updated 6 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 7 months ago
- Code for procedurally synthesizing PHASE animations☆15Updated 3 years ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆224Updated last week
- An awesome reading list for intuitive physics raning from cognitive studies to computational studies.☆15Updated 10 months ago
- ☆207Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆119Updated last year
- ☆95Updated last year
- Governance of the Commons Simulation (GovSim)☆52Updated 5 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆161Updated 3 months ago
- ☆20Updated 11 months ago
- ☆96Updated 11 months ago
- ☆69Updated last year
- General-Sum variant of the game Diplomacy for evaluating AIs.☆29Updated last year
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month
- Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiatio…☆33Updated 7 months ago
- ☆25Updated last year
- Alignment with a millennium of moral progress. Spotlight@NeurIPS 2024 Track on Datasets and Benchmarks.☆22Updated 2 months ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆31Updated 11 months ago
- The KiloGram Tangrams dataset☆55Updated 2 months ago
- Plurals: A System for Guiding LLMs Via Simulated Social Ensembles☆22Updated last week
- Repository for the paper Stream of Search: Learning to Search in Language☆148Updated 4 months ago
- ☆22Updated last year
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆81Updated last year
- Code for "Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies"☆34Updated last year
- ICLR 2025 Workshop & CHI 2025 SIG: "Bidirectional Human-AI Alignment"☆37Updated 10 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆138Updated last year
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year