ccs-ucb / llms-cogsciLinks
Psych 290Q S23 @ UC Berkeley: Large Language Models and Cognitive Science
☆18Updated last year
Alternatives and similar repositories for llms-cogsci
Users that are interested in llms-cogsci are comparing it to the libraries listed below
Sorting:
- ☆137Updated 8 months ago
- ☆206Updated last year
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆229Updated 3 weeks ago
- Governance of the Commons Simulation (GovSim)☆55Updated 5 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆275Updated this week
- ☆20Updated last year
- Code for "Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies"☆34Updated last year
- General-Sum variant of the game Diplomacy for evaluating AIs.☆29Updated last year
- An awesome reading list for intuitive physics raning from cognitive studies to computational studies.☆15Updated 11 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆163Updated 4 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆118Updated last year
- ☆32Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆109Updated 8 months ago
- ☆94Updated last year
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆183Updated 2 years ago
- ICLR 2025 Workshop & CHI 2025 SIG: "Bidirectional Human-AI Alignment"☆37Updated 11 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆82Updated last year
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- ☆12Updated last year
- Evaluating the Moral Beliefs Encoded in LLMs☆26Updated 7 months ago
- Code for procedurally synthesizing PHASE animations☆16Updated 3 years ago
- ☆72Updated last year
- Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"☆22Updated 3 years ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆56Updated 9 months ago
- Materials for ConceptARC paper☆96Updated 8 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆140Updated last year
- Data and code for the paper "NormBank: A Knowledge Bank of Situational Social Norms"☆29Updated last year
- maze datasets for investigating OOD behavior of ML systems☆48Updated last month
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- Plurals: A System for Guiding LLMs Via Simulated Social Ensembles☆24Updated 3 weeks ago