marcelbinz / CENTaURLinks
☆35Updated last year
Alternatives and similar repositories for CENTaUR
Users that are interested in CENTaUR are comparing it to the libraries listed below
Sorting:
- Largest, cross-domain data set of human behavior.☆81Updated 3 months ago
- ☆177Updated 3 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆240Updated last month
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆128Updated last year
- A virtual environment for developing and evaluating automated scientific discovery agents.☆188Updated 7 months ago
- ☆77Updated last year
- ☆137Updated 2 months ago
- ☆70Updated last year
- ☆21Updated last year
- ☆69Updated 3 years ago
- Governance of the Commons Simulation (GovSim)☆59Updated 8 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆296Updated 3 months ago
- large population models☆431Updated last week
- Benchmarking Agentic LLM and VLM Reasoning On Games☆201Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 8 months ago
- Psych 290Q S23 @ UC Berkeley: Large Language Models and Cognitive Science☆21Updated last year
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆250Updated last week
- Automated Research Assistant☆64Updated this week
- How to create rational LLM-based agents? Using game-theoretic workflows!☆78Updated 4 months ago
- Extracting spatial and temporal world models from LLMs☆257Updated 2 years ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆128Updated 4 months ago
- Awesome Open-ended AI☆348Updated 2 months ago
- ☆219Updated 2 years ago
- ☆103Updated last year
- ☆128Updated last year
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆286Updated last week
- Intrinsic Motivation from Artificial Intelligence Feedback☆131Updated last year
- Training and inference code for "Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning"☆41Updated 8 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆68Updated 9 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆71Updated 2 years ago