josh-ashkinaze / pluralsLinks

Plurals: A System for Guiding LLMs Via Simulated Social Ensembles

☆24

Alternatives and similar repositories for plurals

Users that are interested in plurals are comparing it to the libraries listed below

Sorting:

oughtinc / primer
Factored Cognition Primer: How to write compositional language model programs
☆49Updated 2 years ago
TransluceAI / observatory
A toolkit for describing model features and intervening on those features to steer behavior.
☆195Updated 9 months ago
forecastingresearch / forecastbench
A dynamic forecasting benchmark for LLMs
☆27Updated this week
compdemocracy / openData
Data exports from select "open data" Polis conversations
☆39Updated 10 months ago
minalee-research / coauthor-interface
☆95Updated last year
vinid / NegotiationArena
☆72Updated last year
KihoPark / LLM_Categorical_Hierarchical_Representations
☆104Updated 5 months ago
haizelabs / thorn-in-haizestack
Thorn in a HaizeStack test for evaluating long-context adversarial robustness.
☆26Updated last year
egozverev / Should-It-Be-Executed-Or-Processed
Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.
☆54Updated 5 months ago
anthropics / evals
☆287Updated last year
simonw / llm-cluster
LLM plugin for clustering embeddings
☆80Updated last year
centerforaisafety / emergent-values
Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"
☆54Updated 5 months ago
Data-Provenance-Initiative / Data-Provenance-Collection
☆245Updated 4 months ago
AZHenley / papers-ux-ai-programming
List of research papers of research papers investigating the user experience of AI-powered programming assistants (e.g., Copilot).
☆99Updated last year
giorgiopiatti / GovSim
Governance of the Commons Simulation (GovSim)
☆56Updated 6 months ago
haizelabs / dspy-redteam
Red-Teaming Language Models with DSPy
☆203Updated 5 months ago
expectedparrot / edsl
Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers…
☆261Updated this week
microsoft / mechanistic-error-probe
A mechanistic approach for understanding and detecting factual errors of large language models.
☆47Updated last year
invariantlabs-ai / explorer
A better way of testing, inspecting, and analyzing AI Agent traces.
☆39Updated last month
causalNLP / logical-fallacy
Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)
☆78Updated last year
aypan17 / machiavelli
☆137Updated 2 weeks ago
PAIR-code / farsight
In situ interactive widgets for responsible AI 🌱
☆27Updated last year
ccs-ucb / llms-cogsci
Psych 290Q S23 @ UC Berkeley: Large Language Models and Cognitive Science
☆19Updated last year
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 6 months ago
michelle123lam / lloom
Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…
☆118Updated 2 months ago
causalNLP / cladder
We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.
☆119Updated last year
r-three / common-pile
Code for collecting, processing, and preparing datasets for the Common Pile
☆216Updated 2 weeks ago
haizelabs / verdict
Inference-time scaling for LLMs-as-a-judge.
☆272Updated 3 weeks ago
epfl-dlab / GPTurk
☆29Updated last year
HannahKirk / prism-alignment
The Prism Alignment Project
☆79Updated last year