josh-ashkinaze / pluralsLinks
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles
☆24Updated last month
Alternatives and similar repositories for plurals
Users that are interested in plurals are comparing it to the libraries listed below
Sorting:
- Factored Cognition Primer: How to write compositional language model programs☆49Updated 2 years ago
- A toolkit for describing model features and intervening on those features to steer behavior.☆195Updated 9 months ago
- A dynamic forecasting benchmark for LLMs☆27Updated this week
- Data exports from select "open data" Polis conversations☆39Updated 10 months ago
- ☆95Updated last year
- ☆72Updated last year
- ☆104Updated 5 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated last year
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆54Updated 5 months ago
- ☆287Updated last year
- LLM plugin for clustering embeddings☆80Updated last year
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆54Updated 5 months ago
- ☆245Updated 4 months ago
- List of research papers of research papers investigating the user experience of AI-powered programming assistants (e.g., Copilot).☆99Updated last year
- Governance of the Commons Simulation (GovSim)☆56Updated 6 months ago
- Red-Teaming Language Models with DSPy☆203Updated 5 months ago
- Design, conduct and analyze results of AI-powered surveys and experiments. Simulate social science and market research with large numbers…☆261Updated this week
- A mechanistic approach for understanding and detecting factual errors of large language models.☆47Updated last year
- A better way of testing, inspecting, and analyzing AI Agent traces.☆39Updated last month
- Repo for the paper "Detecting Logical Fallacies: From Quiz to Climate Change News" (2021)☆78Updated last year
- ☆137Updated 2 weeks ago
- In situ interactive widgets for responsible AI 🌱☆27Updated last year
- Psych 290Q S23 @ UC Berkeley: Large Language Models and Cognitive Science☆19Updated last year
- Sphynx Hallucination Induction☆53Updated 6 months ago
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆118Updated 2 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆119Updated last year
- Code for collecting, processing, and preparing datasets for the Common Pile☆216Updated 2 weeks ago
- Inference-time scaling for LLMs-as-a-judge.☆272Updated 3 weeks ago
- ☆29Updated last year
- The Prism Alignment Project☆79Updated last year