josh-ashkinaze / pluralsLinks
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles
☆22Updated last week
Alternatives and similar repositories for plurals
Users that are interested in plurals are comparing it to the libraries listed below
Sorting:
- ☆48Updated 3 weeks ago
- Data exports from select "open data" Polis conversations☆37Updated 8 months ago
- ☆95Updated last year
- In situ interactive widgets for responsible AI 🌱☆24Updated last year
- ☆69Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆49Updated 2 years ago
- LLM plugin for clustering embeddings☆76Updated last year
- ☆97Updated 4 months ago
- The Prism Alignment Project☆77Updated last year
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆95Updated 2 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆53Updated 3 months ago
- ☆134Updated 7 months ago
- ☆106Updated last year
- ☆29Updated last year
- Evaluating the Moral Beliefs Encoded in LLMs☆26Updated 6 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆51Updated 4 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆119Updated last year
- ☆16Updated 9 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆81Updated last year
- Psych 290Q S23 @ UC Berkeley: Large Language Models and Cognitive Science☆18Updated last year
- The Foundation Model Transparency Index☆81Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆105Updated 2 weeks ago
- Automated Qualitative Analysis of LLMs (ICLR 2025)☆39Updated 2 months ago
- Governance of the Commons Simulation (GovSim)☆51Updated 5 months ago
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆78Updated last year
- Run SWE-bench evaluations remotely☆21Updated last month
- PAIR.withgoogle.com and friend's work on interpretability methods☆192Updated this week
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆111Updated 3 weeks ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 7 months ago