ccs-ucb / llms-cogsciLinks

Psych 290Q S23 @ UC Berkeley: Large Language Models and Cognitive Science

☆19

Alternatives and similar repositories for llms-cogsci

Users that are interested in llms-cogsci are comparing it to the libraries listed below

Sorting:

aypan17 / machiavelli
☆137Updated 2 weeks ago
acsresearch / interlab
☆20Updated last year
marcelbinz / CENTaUR
☆35Updated last year
sotopia-lab / sotopia
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
☆236Updated this week
MicroSTM / PHASE
Code for procedurally synthesizing PHASE animations
☆16Updated 3 years ago
understanding-search / maze-dataset
maze datasets for investigating OOD behavior of ML systems
☆51Updated this week
giorgiopiatti / GovSim
Governance of the Commons Simulation (GovSim)
☆56Updated 6 months ago
gabegrand / world-models
☆209Updated 2 years ago
likenneth / othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆186Updated 2 years ago
allenai / discoveryworld
A virtual environment for developing and evaluating automated scientific discovery agents.
☆168Updated 5 months ago
marcelbinz / Psych-201
Largest, cross-domain data set of human behavior.
☆75Updated 3 weeks ago
mukobi / welfare-diplomacy
General-Sum variant of the game Diplomacy for evaluating AIs.
☆29Updated last year
balrog-ai / BALROG
Benchmarking Agentic LLM and VLM Reasoning On Games
☆179Updated 3 weeks ago
allenai / ScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
☆283Updated 3 weeks ago
meg-tong / sycophancy-eval
datasets from the paper "Towards Understanding Sycophancy in Language Models"
☆86Updated last year
google-deepmind / dangerous-capability-evaluations
☆55Updated last week
jlin816 / dialop
DialOp: Decision-oriented dialogue environments for collaborative language agents
☆109Updated 8 months ago
anthropics / evals
☆287Updated last year
minalee-research / coauthor-interface
☆95Updated last year
piantado / LOTlib3
Language of thought library for python 3
☆49Updated last year
EleutherAI / elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆209Updated this week
huashen218 / bidirectional-alignment-reading-list
ICLR 2025 Workshop & CHI 2025 SIG: "Bidirectional Human-AI Alignment"
☆38Updated last year
abdulhaim / LMRL-Gym
☆99Updated last year
TransluceAI / observatory
A toolkit for describing model features and intervening on those features to steer behavior.
☆195Updated 9 months ago
wesg52 / world-models
Extracting spatial and temporal world models from LLMs
☆255Updated last year
keyonvafa / world-model-evaluation
☆62Updated 8 months ago
flowersteam / lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆237Updated 9 months ago
vinid / NegotiationArena
☆72Updated last year
victorvikram / ConceptARC
Materials for ConceptARC paper
☆98Updated 9 months ago
microsoft / SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …
☆140Updated last year